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ABSTRACT 


An  exposition  of  the  potentiality  of  production 
systems  as  6  mode!  of  the  detailed  control  structure 
of  humans.  Contains  a  detailed  treatment  of  the  ele¬ 
mentary  Sternberg  reaction  time  experiments  in  binary 
classification  as  a  means  of  exhibiting  the  uses  of 
production  systems.  Leau*  to  a  hypothesis  fcr  these 
ixperiments  different  rom  the  usual  one  of  exhaustive 
search,  called  the  Decoding  Hypothesis. 
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PRODUCTION  SYSTEMS:  MODELS  OF  CONTROL  STRUCTURES 
Allen  Newell 

C  rnepU*~Mellon  University 

A  production  system  is  a  scheme  for  specifying  an 
information  processing  system.  It  consists  of  a  set 
of  productions,  each  production  consisting  of  a  con¬ 
dition  and  an  action.  Tt  has  also  n  collection  of 
data  structures:  expressions  that  encode  the  infor¬ 
mation  ^pon  which  the  production  system  works— on  which 
„he  actions  operate  and  on  which  the  conditions  can  be 
determined  to  he  true  or  false. 

A  production  system,  starting  with  an  initially 
Riven  ser  of  data  structures,  operates  as  follows. 

That  production  whose  condition  is  true  of  the  current 
data  (assume  there  is  only  one)  is  executed,  that  is, 
the  action  is  tak'n.  The  result  is  to  modify  the  cur¬ 
rent  data  structures.  This  leads  in  the  next  instant 
to  another  (posjibly  the  same)  production  being  executed, 
leading  to  sti'.l  further  modi  f  teat  ion.  So  ft  goes, 
action  aftei  <ctlon  being  taken  to  carry  out  an  entire 
program  of  processing,  each  evoked  hy  its  condition 
becoming  true  of  the  aomentarily  current  collection  of 
data  structures.  Tne  entire  process  halts  either  when 
no  condition  is  true  'hence  nothing  is  evoked)  or  when 
an  action  containing  a  stop  operation  occurs. 

Much  remains  to  be  specified  in  the  above  scheme 
to  yield  a  definite  information  processing  system.  What 
happens  (*»  likely  occurrence)  if  more  then  one  produc¬ 
tion  is  satisfied  at  once?  What  is  the  actual  scheme 
for  encoding  information?  What  sort  of  collection  of 
data  structures  constitutes  the  current  state  of  knowl¬ 
edge  or  which  the  system  works?  What  sort  o*  tests 
are  expressible  in  the  conditions  of  productions?  What 
sort  of  primitive  operations  are  performable  cn  the  data 
and  what  collections  of  these  are  expressible  in  the 
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actions  of  productions?  What  sorts  of  additional 
.memories  are  available  and  how  ore  they  accessed  .and 
written  into?  How  i:  the  production  system  itself 
modified  from  within,  or  f9  this  possible?  How  much 
time  (or  effort)  is  taken  by  the  various  components  of 
the  system  and  how  do  they  combine  to  yield  a  total 
time  for  an  entire  processing? 

There  are  many  questions  which  can  be  answered  in 
many  different  ways.  Bach  assemblage  of  answers  yields 
a  different  production  system  with  different  properties 
from  its  siblings.  Taken  in  alit  they  constitute  a 
family  of  schemes  for  specifying  information  processing 
systems.  Within  this  family  can  be  found  almost  any 
process  specification  scheme  one  could  like — though  not 
in  fact  all  possible  schemes.  There  are  other  ways  of 
specifying  the  information  processing  to  be  done.  There 
are  languages,  such  as  Algol  and  Fortran,  that  take  as 
their  basis  a  specified  sequence  of  operating-processes 
to  be  performed,  punctuated  by  test-proce6f es  that 
explicitly  direct  processing  to  switch  +  n  mother 
sequence.  There  are  languages,  such  as  SNOBOL,  that 
use  productions  (conditions  associating  to  actions), 
but  each  production  explicitly  switches  the  processing 
this  way  or  that  to  other  sequences  of  production. 

..ook  at  the  situation  a  different  way.  Suppose 
you  know  about  an  information  processing  system:  its 
memories,  its  encodings  and  its  primitive  operations 
(both  tests  and  manipulations).'  Wha*  more  would  you 
require  to  obtain  a  complete  picture?  You  need  to  know 
how  the  system  organizes  these  primitives  into  an  effec¬ 
tive  processing  of  its  Knowledge.  This  additional 
organization  is  called  the  control  struotuve.  Produc¬ 
tion  systems  are  a  type  of  control  structure. 

The  purpose  of  this  paper  is  to  illustrate  the 
possibility  of  having  a  theory  of  the  control  structure 
of  human  information  processing.  Gains  seem  possible 
in  many  forms:,  completeness  of  the  microtheories  of  how 
various  miniscule  experimental  tasks  are  performed;  the 
«bllltv  to  pose  meaningfully  the  problem  of  yhat  method 
»ubj«t:L  i*  usjlh*,'  to.  suggest  s«**.  tpccfl ar_ 

nisew  for  accomplishing  a  task;  the  facilitation  of 
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eon&paring  behavior  on  diverse  task#* 

We  Illustrate  by  actually  proposing  a  theory  of 
the  control  structure.  We  are  in  earnest  about  the 
theory;  in  this  respect  we  ar*  being  more  than  illus¬ 
trative.-  However,  to  be  taken  seriously,  a  theory  of 
control  should  encompass  a  substantially  greater  scope 
of  experiments  than  we  are  able  to  deal  With  here.  This 
also  appears  to  be  the  first  explicit  model  of  the 
control  structure  at  this  level  of  detail.  It  would 
hardxv  seem  that  details  of  the  structure  are  right- 
even  ic  (as  I  currently  believe)  a  production  system 
of  some  sort  appears  to  be  a  suitable  model  of  the 
human  control. 

Our  plan  is  to  present  a  particular  production 
system,  noting  its  psychological  properties,  but  with 
no  attempt  to  defend  it  against  variant  schemes.  Using 
this  system  we  will  conduct  e.n  analysis  of  the  basic 
Sternberg  paradigm,  which  underlies  several  of  the 
experiments  discussed  in  the  present  symposium.  With 
this  basic  analysis  in  hand,  we  will  then  discuss  In 
varying  level?  of  detail  the  potentialiten  of  produc¬ 
tion  system?  as  models  for  human  control  and  the  issues 
raised  thereby. 

PSG:  A  Particular  Production  System 

The  particular  production  system  presented  here, 

PSG  (for  production  system  version  G) ,  was  developed 
as  a  continuation  of  work  with  problem  solving,  in 
crypt-arithraetit  (Newell  &  Sinon,  1972,  Chapters  5-7)., 
The  original  data  that  PSG  was  designed  to  deal  with 
were  about  an  order  of  magnitude  grosser  than  the 
reaction  time  data  that  currently  seem  most  appropriate 
to  defining  the  behavior  of  the  immediate  processor — 
i $ c. ,  it  worked  with  freely  nrodM^d  ^.trases  of  a  few 
second*  dprailo*.*-  A  recent  paper  (Newell,  1972) 
describes  PSG  and  begins  the  task  of  applying  it  to 
the  more  detailed  situation,  focussing  on  the  problem 

fhe^overall  architecture  of  tHtPsystera  is  shown  ^ 
In  Figure  X,  All  of  the  action  in  the  system  takes 
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Fig.  X.  Overall  architecture  of  PSG. 

place  in  the  Short  Term  Memory  (STM) ,  which  contains 
a  set  of  symbolic  expressions.  STM  is  to  be  identified 
with  the  memory  of  Miller  (1956)  and  Waugh  and  Norman 
*1965), 5  its  site  is  some  small  number  of  chunk* 
(proverbially  7+2). 


!We  prefer  not  to  use  the  terms  primary  and  second¬ 
ary  memory  introduced  by  Waugh  and  Norman,  since  the 
terms  conflict  directly  with  their  use  in  computer 
science.  There,  primary  memory  is  the  memory  that  a 
processor  can  access  for.  its  program,  secondary  memory 
being  more  remote  (e.g.,  a  disk  or  magnetic  tape,  see 
Bell  6  Newell,  1971),  What  Waugh  and  Norman  call  pri¬ 
mary  memory  woutu  be  called  •>  scratchpad  memory  or  a 
working  memory,  STM  seems  suitable  as  a  name. 
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There  Is  no  direct  representation  in  PSG  of  the 
various  buffer  memories  that  appear  to  be  part  of  the 
immediate  processor  of  the  human:  the  visual  iron  of 
Sparling  (1960),  (possibly)  the  precategor Jcal  auditory 
store  of  Crowder  and  Morton  (1969),  and  others.  The 
interface  to  the  senses  is  not  represented  as  well,  nor 
is  the  decoding  on  the  motor  side.  Such  deficiencies 
in  the  architectural  model  undoubtedly  limit  the  scope 
and  adequacy  of  the  system,  but  will  not  be  of  first 
importance  in  this  paper. 

The  STM  holds  an  ordered  set  of  symbolic  expres¬ 
sions  (i.e.,  chunks).*  The  ordering  shows  up,  as  will 
be  seen  later,  in  that  new  expressions  always  enter  STM 
at  the  front  and  that  the  conditions  examine  the  expres¬ 
sions  in  order  starting  at  the  front  (hence  the  frontal 
expressions  may  preempt  iater  ones).'  As  can  be  seen  in 
Figure  1,  a  symbolic  expression  may  be  simply  a  symbol 
(e.<>*»  CC?  or  it  may  consist  of  an  ordered  collection 
of  symbolic  expressions  (e.g.,  (EL  (AA  DD))  ).  Thus, 
symbolic  expressions  ma,  be  built  up  in  a  nested 
fashion,  and  we  can  represent  them  in  the  manner  of 
algebraic  expressions.  STM  may  bo  taken  as  holding 
symbol  tokens  (i.e.,  pointers)  to  the  expressions,  or 
it  may  be  taken  as  holding  the  expressions  themselves. 
Operationally,  there  is  no  way  of  telling  the  differ¬ 
ence.  The  degree  to  which  an  element  in  STM  is  opaque 
(Johnson,  1970)  Is  determine;:  by  the  conditions  of  the 
productions,  which  in  essence  are  a  description  of  what 
aspects  of  an  expression  can  be  responded  to. 

The  Long  Terra  Memory  (LTM)  consists  entirely  of  an 
ordered  set  of  productions.  Each  production  is  writte.i 
with  the  condition  on  the  left  separated  from  the  action 
on  the  right  by  an  arrow.  In  Figure  1  only  four  produc¬ 
tions  are  shown,  PD1,  PD2,  PD3  and  PP4.'  Some  of  the 
conditions  (e.g.,  that  of  PDA)  consist  of  only  a  single 
symbolic  expression  (e.g.,  PDA  has  AA) ;  others  have  a 
conjunction  of  two  (e.g.,  PD1  has  AA  and  BB) .  Some 
actions  consist  of  a  single  symbolic  expression  (e.g., 
PD3  with  BB),  some  have  a  sequence  of  expressions 
(e.g.,  PDA  with  CC  followed  by  DD) ,  some  have  expres¬ 
sions  that  indicate  operations  to  be  performed  (e.g,, 
rhe  SAY  in  PD2) . 
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We  will  not,  for  the  purpose*;  of  this  paper,  be 
considering  either  the  question  ol  other  tvper  of  LTM 
or  of  storing  new  information  (new  productions)  in  LTM. 
This  imposes  n  substantial  restriction  on  the  classes 
of  experiments  we  can  consider,  bin.  this  class  still 
includes  many  of  these  in  the  present  symposium.  Our 
assumption  about  LTM  implies  a  form  of  homogeniety,  but 
not  one  that  precludes  having  essentially  distinct  mem- 
crin  *or  (say)  distinct  modalities- -the  distinctive¬ 
ness  arises  from  the  content  of  the  conditions,  not 
from  the  structure-  of  the  memory  itself.  The  creation 
of  new  expressions  in  STM  is  not  to  be  taken  as  creating 
them  in  LTM  3S  well.  Thus  chunking  is  separated  from 
storing  the  chunks  in  LTM  so  they  can  be  retrieved 
later. 

As  the  system  stands  initially,  none  of  the  pro¬ 
ductions  is  satisfied  by  the  contents  of  STM  and  nothing 
happens.  However,  we  have  shown  an  AA  about  to  enter 
into  STM  from  the  external  world.  When  it  does  so  we 
get  the  situation  of  Figure  2.  Here  we  have  shifted 
to  the  representation  of  the  system  we  will  use  from 
now  on.:  All  tne  essential  elements  in  Figure  1  are 
represented,  only  the  various  enclosing  boxes  and 
input/output  arrows  are  missing.  STM  now  holds  the  AA 
and  has  lost  the  TT  from  the  far  right.  (The  STMT  in 
the  figure  is  the  initial  contents  cf  STM.) 

In  Figure  3  we  Miow  the  trace  of  the  run,  as  it  is 
produced  by  the  system. 7  At  each  cycle  the  production 
that  is  true  (i.e.,*  the  first  whose  condition  is  true) 
is  noted,  folio. ed  by  each  action  when  it  is  taken.; 

Then  the  new  state  of  STM  is  printed  and  the  cycle 
repeats.  The  numbers  to  the  left  are  a  count  of  the 
number  of  actions  that  have  occurred  so  far  in  the  run, 

00100  PS  ONE'  (PDI  no?  P03  PO  D 

00?00  j 

0C300  PCI  IAA  ANCB8 ->lOvO»«D 
00400  PC2  ICC  A  NO  BR  ->  (SAY  M>» 

00500  POJ  (00  AND  <ru  >  88) 

00600  FC4  (AA  »  CC  DO) 

00700  » 

00800  S»W  (AA00(EE  )r>PRSS) 

OOSOO  i 

Fig.  2.  F,xampif*  production  -./stem  PS. ONE 
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00100  0  ST*  (AA  QQ  (tf  ff  >IW  SS) 
00200  P04  TRW 
00300  0  ACTION  1C 
00400  I  ACUW  DO 

cos'-  v  s»"»  iu  •* ;? 
oopoo  m3  iwuc 

00/00  2  ALlfc'N  oy 

00*00  3  STM  (PO  DO  (tt  cf )  CC  AA) 

00900  mi  TRW 

O' 000  3  ACTION-  <ClD  •.) 

oiioo  4  stm  «otOAA)eaootttrncc> 

0.200  PC2  TRW 
01300  4  ACTION*  (SAY  M> 

01400 

OlhOO  ihmhni  M 
01600 

01700  5  STM  (CC  SB  (OlO  AA)  DO  (EE  P')) 

01800  PC?  TRUE 

01900  5  ACTON  (SAY  M< 

02000 

C210G  HMIMHt  4 
02200 

02300  6  STM  (CC  8B  l»  0  AA)  00  (ft  FT)) 

02400  P02  TRW 

02500 

Fig.  3.  Run  of  PS. ONE 


Let  us  work  through  the  trace,  explaining  how  the 
conditions  and  actions  operate.  The  only  condition  of 
the  four  productions  satisfied  is  that  of  PD4 ,  the  AA 
on  the  left  side  of  PD4  matching  the  AA  in  STM.  This 
leads  to  the  action  of  PD4  being  evoked,  first  th?  CC 
then  the  DD.,  Notice  that  AA  is  still  in  STM  but  RR  and 
SS  have  disappeared  off  the  end.  This  can  be  seen  in 
Figure  3  at  Line  500  where  the  contents  of  STM  are 
printed  after  all  actions  for  production  PD4  have  been 
taken. 

A  piodtiction  (PDA)  having  been  successfully  evoked, 
the  system  starts  the  cycle  over.  PD4  is  of  course 
still  satisfied  since  AA  is  still  in  STM.  But  is 
also  satisfied  since  the  DD  matches  the  DD  in  STM  and 
the  (EE)  also  matches  the  (EE  (rl  FF))  in  STM.  This 


2PSG  Is  a  programming  svstea  coded  in  <  system 
building  language  called  L*(G)  (see  Newel!,  McCracken, 
Robf  rtson  and  Freeman  (1971)  for  ,m  overview  of  L*(F), 
the  immediate  predecessor  of  L*(G)).,  PSfi  operates  on 
a  PDP10  and  the-  runs  in  this  paper  were  made  on  the 
PDP10  system  of  the  H41’  Computer  Science  Department.; 
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latter  follows  frota  one  of  several  matching  rules  in 
PSG.  This  one  says  that  a  match  occurs  If  the  condi¬ 
tion  matches  completely,  starting  with  the  first  symbol 
in  the  STM  expression  brt  optionally  skipping  some, 

Inus  (EE)  would  alst  match  (EE  (F?  GG)).  but  would  "V't 
luatoh  on  expression  without  EC -at  the  front,  e.g.  , 

(FF  EE) . 

When  two  productions  are  simultaneously  satisfied, 
the  rule  for  resolving  such  conflicts  is  to  take  the 
first  one  in  order — here  PD3.  The  result  of  FDD's 
action  is  to  put  3B  into  STM  .as  shown  at  Step  2. 

Notice  that  when  PD3  was  evoked  the  two  items  in 
its  condition  moved  up  to  the  from  of  STM  in  the  same 
order  as  in  the  condition.  Thus,  attended  items  stay 
current  in  STM,  rhile  the  others  drift  down  toward  the 
end,  ultimately  to  te  lost.-  This  mechanism  provides  a 
form  of  automatic  rehearsal,  though  it  does  not  pre¬ 
clude  deliberate  rehearsal.  It  also  implies  that  the 
order  of  the  items  in  STM  does  not  remain  fixed,  but 
f lops  around  wi  th  toe  details  of  processing. 

At  the  next  cvcle  PD1  is  evoked,-  being  the  first 
of  the  productions  satisfied,  which  includes  PD2,  PD3 
and  PDA.  The  action  of  PD1  introduces  a  basic  encoding 
(l.e.,  construction)  operation.-  (OLD**)  is  a  new 
expression,  which  will  go  u.«.u  STM  like  any  other.  But 
**  is  a  variable  whose  value  is  the  front  elcT.nt  *. 
STM. *  In  the  case  m  point  the  front  element  is  AA, 
which  was  moved  up  bv  the  automatic  rehearsal  when  the 
condition  of  PD1  was  satisfied.  Hence  the  new  element 
is  (OLD  AA) .  This  clement  rcylaaea  the  front  element, 
rather  than  simply  pushing  unto  the  front,  ^he  net 
effect  is  to  take  the  frent  element  ani  embed  it  in  a 
larger  expression.  Any  expression  may  be  written  vith 
**.  For  examole,  if  the  action  or  PDl  had  been  (XX  ** 
(YY  **)),  then  the  new  element  replacing  AA  i«.  STM 


^The  constructive  operation  using  **  is  an  addition 
to  PSG  beyond  Newell  (1972).  There  ve  used  a  replace¬ 
ment  operation  to  modify  STM  elements;,  herr  no  modifi¬ 
cation  is  possible. 
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wol Id  have  been  (XX  AA(YY  AA)),  creating  a  rather 
comp  ex  encoding.  It  is  Important  that  the  AA  no 
*onp(cr  exist  in  STM  (i.e.,  as  the  second  element,  after 
pushing  In  the  code),  since  it  is  necessary  to  modify 
STH  so  AA  cannot  re-evoke  -*  production.. 

The  import  of  PDi's  action  is  that  it  deactivates 
the  STM  item  able  to  e;oke  PJ>4  (and  itself,  as  we1!). 

On  the  next  evcle  only  PP2  is  satisfied.;  Tts  action 
involves  SAY,  which  is  a  primitive  operation  of  the 
system  that  prints  out  the  expression  following  it  in 
the  element,;  i.e.,  tc  prints  HI  (as  shown  in  the 
figure) . 

We  see  from  figure  3  that  the  system  continues  to 
evoke  PD2  and  say  HI.  Nothing  happens  to  modify  STM  8l. 
the  condition  of  PD2  remains  satisfied.  If  we  had 
written? 

PD2:  (CC  AND  BB  — >  (SAY  HI)  (OLD  **)) 

then  the  production  system  would  have  turned  off  by 
marking  CC  as  old. 

We  have  indicated  by  illustration  a  number  of 
details  of  PSG,  enough  to  permit  us  to  turn  to  the 
analysis  of  a  substantive  example,*  The  details  given 
*»„  tar  are  not  sufficient.  There  is  a  somewhat  wider 
err  :y  of  primitive  operations  and  many  more  details  of 
the  matching  operation  for  conditions  (Newell,  1972). 

We  will  introduce  the  additional  aspects  of  this 
specif i  ion  as  required  throughout  the  paper. 

Wo  a  see,  even  at  this  stage,  that  many  assump¬ 

tions  are  required  ro  specify  a  complete  control  struc¬ 
ture.  Some  of  the  ,  such  as  the  STM  itself,  its 
encoding,  nnd  the  automatic  rehearsal,  constitute 
rr^he:  clear  psychology!  postulates.  Cthera,  such 
as  the  details  of  matching  have  psychological  impli¬ 
cations  (presumably  every  aspect  of  the  system  does), 
but  it  is  hard  to  know  how  to  state  them  directly  as 
independent  postulates. 
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The  Sternberg  Paradigm 

Let  us  consider  the  simplest  r, f  ail  binary  clas¬ 
sification  ta*ks  studied  by  Sternberg  (197u),  The 
subject  memorizes  a  small  set  of  symbols,  scy  digits.. 
This  is  called  the  pocitlV*  art.  In  a  trial  M  the 
experiment  proper  the  subject  is  given  a  ready  signal, 
followed  by  a  digit  after  a  short  fixed  delay.  The 
subject  responds  "yes”  if  .this  so-called  probe  digit 
is  a  member  of  the  positive  set,  "no”  if  it  is  not. 

The  "yea”  and  ”no”  responses  are  usually  encoded  into 
button  pressings.  Many  trials  are  given,  so  that  the 
task  become*  well  practiced,  t’*e  goal  being  to  respond 
as  quickly  possible  while  keeping  a  very  row  error 
rate.  The  positive  «et  is  varied  in  blocks,  both  as  to 
size  and  coopo~.li  ion.  The  Measure  taken  is  the  re¬ 
sponse  time  (RT)  f^c-n  presentation  of  the  signal  to 
response,  measured  in  milliseconds  (ms'» .. 

The  results  of  this  experiwnt  are  well  known  and 
ferm  a  basis  for  a  number  of  the  experiments  which  arc 
discuss'd  oy  Posner  (Chapter  2)  and  Hayes  (Chapter  <4) 
in  the  present  symposium.  Let  us  just  summarize  the 
basic  findings: 

(1)  Response  time  is  linear  with  the  size 
of  the  positiv  set,  the  slope  being  In 
tins  range  of  35-40  os.  The  natural 
interpretation  is  that  a  search  ir  made 
through  th  positive  set. 

(2)  The  intercept  is  of  tb«  order  of  350 
ras,  but  absolute  magnitude  is  never 
analysed  in  detail  since  iv  contains 
several  unknown  components  (e.a,f 
motor  response  time). 

(3)  The  size  of  the  positive  set  can  be 
be  up  to  the  sise  normally  associated 
with  STM,  i.e.,  around  seven  elements. 


u  - 
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(4)  The  slope  for  negative  responses  (when 
the  probe  digit  is  not  ir  the  set)  Is 
the  same  as  for  positive  responses 
(when  the  probe  is  in  the  set).  This 
violates  the  results  exported  if  the 
search  is  terminated  whenever  it  found 
the  probe  in  the  sot  (which  would  make 
the  positive  set  appear  to  be  on  the 
average  only  half  os  large  in  the  case 
of  positive  responses;.  This  gives 
rise  to  an  interpretation  of  so-called 
exhaustive  search  (as  opposed  to  so- 
called  self-terminating  search). 

(5)  There  is  essentially  no  serial  position 
effect  (the  time  it  cakes  to  respond 

to  a  positive  prebe  as  a  function  of 
where  in  the  positive  9et  the  probe 
digit  occurs).  This  agrees  with  the 
exhaustive  search  notion. 

(6)  The  negative  response  can  differ  from 
the  positive  response  by  a  constant 
amount  (independent  of  set  size,  so  the 
two  linear  curves  lie  parallel).  The 
amount  ia  usually  about  50  ms,  depending 
on  experimental  conditions. 

Much  more  is  known  about  this  simple  task,  a  full 
list  including  all  the  qualifications  to  the  above 
vovlj  probably  run  to  a  hundred  statements,  rather  than 
the  six  abr”e.  The  basic  results  are  highly  reproduc¬ 
ible  and  robust.  The  total  set  of  results,  however, 

Is  by  no  means  easily  seen  to  be  consistent  with  any 
simple  mod' ’ 

We  can  use  this  paradigm  to  Illustrate  concretely 
what  a  model  of  the  control  system  xuvolves  and  how  it 
mat  as  contact  with  experimental  data.  Since  we  want  to 
reveal  the  strengths  and  issues  with  respect  to  produc¬ 
tion  '.yoteme  we  will  not  simply  present  a  final  system, 
but  will  proceed  by  a  process  of  step-wise  refinement. 


I?  - 
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We  will  work  our  wa«  through  a  series  of  productiun 
systems  until  we  arrive  at  one  that  seems  appropriate 
to  Che  task  and  the 


PS.STT:  Tnriediate  Recognition 

The  obvious  scheme,  shown  in  Figure  •' ,  is  for  STM 
to  contain  the  positive  set,  whence  the  p  obe  ite  Intro** 
duced,  leading  to  the  attempt  at  identification.  We 
cannot  just  have  digits  as  the  elements  in  STM,  since 
we  need  to  distinguish  the  probe  digit  from  the  posi¬ 
tive  set  digits.  Thus,  we  encode  the  digits  of  the  set 
as  (ELM  <DIGfT*»),  where  <DIGIT>  means  that  any  digit 
can  go  in  that  place,  j.g.,  (ELM  5);  likewise  we  encode 
the  probe  as  (PROBE  <DIGIT>) .  The  class  <D1GIT>  is 
defined  explicitly  at  the  top  of  the  figure.  STM  is 
Initialized  with  a  set  of  three  elements  and  a  ready 
signal  (Line  1500).  This  latter  simply  controls  the 
response  to  attend  to  the  stimulus. 

The  labeling  of  items  is  responsive  to  a  general 
Issue.  STM  may  contain  various  odd  expressions  from 
a  diversity  of  sources.;  The  subject  must  (normally) 
he  able  to  distinguish  the  relevant  itemt,  from  the 
irrelevant.  For  instance,  the  positive  set  might  con¬ 
sist  of  2,  3,  4  and  the  subject  (say)  become  aware  of 
the  digit  5  upon  a  final  rehearsal,  so  that  5  is  in  STM 
upon  presentation  of  the  probe.  We  would  not  expect 
the  subject  simply  to  take  5  as  a  member  of  the  positive 


<J>GIT»  (CLASS  omm  8  9) 

AMY  (VAR) 

RfSfW*  (ACTION  tffttt  (PT  VONSt  ANY))  (SAY  ANY)  (OLD  *♦» 
ATTfNO  (OPH  CAU  «Oir,{B) 

I 

PSSU-(fO)  TTOPOJPfW 

POI  <'P«0ur )  AND  (010  (fif  SWSH  -  »  (01 0  »»)) 

P0?<  «PR0()C  -nwT')  AfO(CUI  <OOT»)  (BtSPONSt  Yf$) 
RfSiXYlO) 

P03  «rK  *»  AND  (<IU)  -->  (PfSPr/iST  NO)  WVONO) 
P04(WAiV  >AT1fW» 
l 

ST  Ml?  (PC  DYrHW  |ll(!W«)(f‘M5|!A  NO 


?ig.  ft.  PK.ST1:  Immediate  recognition. 
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set,  though  whether  some  additional  processing  would  be 
called  for  is  not  clear.  In  any  event,  the  general  use 
of  codes  that  declare  Che  nature  of  the  item  seems  to 
be  appropriate  and  we  will  do  it  throughout,  without 
making  apodal  arguments  each  time. 

The  production  system,  PS.ST1,  consists  of  four 
productions.  The  performance  of  the  task  is  accom¬ 
plished  by  PD2  and  PD3.  PD2  is  satisfied  if  there  is 
an  ELM  and  a  PROBE  both  of  which  hAVe  the  sane  digit. 
Thua  the  occurrence  of  the  class  name  <DIGIT>  in  an 
expression  operates  as  a  variable  to  match  against  the 
actual  items  in  STM.  The  action  of  PD2  Is  to  put  into 
STM  a  response  expression  (in  this  case  to  respond  YES) 
and  then  to  fire  as  operator,  RESPOND.  This  operator, 
shown  at  the  top  of  the  figure  at  Line  A00,  consists 
of  a  sequence  of  actions,  l.e.,  essentially  the  right 
side  of  a  production.1*  Thera  are  three  actions  in 
RESPOND.  The  first  action  is  to  notice  anywhere  in  STJ' 
an  element  of  the  form  (RESPONSE  ANY),  where  ANY  Is  a 
variable  that  can  take  any  symbolic  expression  as  value 
(it  is  declared  at  the  top  of  the  figure  at  Line  200). 
NTC  is  a  primitive  operation,  that  performs  a  recog¬ 
nition  of  the  sane  sort  as  is  performed  in  the  matching 
on  Che  condition  side.  The  second  action  is  to  say  the 
value  of  ANY,  which  is  accomplished  by  the  SAY  operation 
used  in  PS.Oh'E.  Finally,  RESPOND  marks  the  RESPONSE 
element  old,  so  that  the  system  now  knows  (in  some 
sense)  that  it  has  said  the  response. 

Production  PD3  ia  sensitive  to  the  occurrence  of 
any  ELM  and  any  PROBE,  and  will  respond  with  NO. 


**It  thus  behaves  like  s  subroutine  from  a  control 
point  of  view.  However,  it  works  with  the  same  STM  as 
do  all  ocher  actions,  '.hat  is,  there  is  no  isolation 
ot  its  data,  ao  there  is  for  instance  with  a  gubro'  cine 
for  computing  the  sine,  SIN(X),  which  operates  lr  an 
isolated  environment  where  it  knows  only  about  'he 
value  of  the  passed  operand,  X.  Whether  or  not  sub¬ 
routine  control  occurs  and  whether  or  net  subro  itine 
data  isolation  occurs  are  psychological  questions 
about  the  human  control  system. 


\ 
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However,  It  sits  behind  PD2  snd  thus  will  only  be  evoked 
if  PD2  is  not,  l.e.,  only  if  the  probe  is  not  a  member 
of  the  positive  set.  Thus,  PS.ST1  composes  its  response 
to  the  task  out  of  s  recognition  of  membership  and  a 
recognition  that  it  is  appropriate  to  respond. 

The  other  two  productions  in  PS.ST1  provide  some 
of  the  additional  control  to  cake  the  system  behave. 

PD4  responds  to  READY  as  does  the  opeiator  ATTEND. 

Since  we  have  no  model  of  the  external  environment,  we 
finesse  the  matter  by  having  ATTEND  call  to  the  console 
of  the  user  to  obtain  the  input '  (which  will  be  de¬ 
scribed  in  Figure  5,  coming  up).  PD1  is  an  analog  to 
PD1  in  PS .Uft'E,' Which  serves  to  recognise  that  the  task 
la  done  and  to  encode  this  by  marking  the  PROBE  element. 
The  effect  of  this  is  to  keep  the  system  from  saying 
YES  YES  YES  ....  aa  PS.OKE  keeps  saying  HI  HI  HI  ... 

Figure  5  shows  a  run  of  PS.ST1,  from  which  it  cm 
be  seen  that  the  system  performs  correctly  in  both  the 
positive  and  negative  cases.  The  system  was  reinitial¬ 
ised  for  the  second  trial  (Line  2600).5  6  When  ATTEND 
fires  it  prints  a  message  to  the  user.  The  user  puts 
in  tin  expression  after  the  prompt  and  then  executes  a 
tZ  to  return  control  to  FSG. 

Variables  occur  In  two  places  in  PS.STls  ANY  in 
RESPOND  and  <DIGIT>  in  the  condition  of  PD2..  In  both 
caces  they  are  asa  gned  a  value  during  the  course  of  a 
match,  in  order  to  satisfy  the  match.  But  they  perform 
distinct  functions. 


5Thus  PSG  operates  in  an  essentially  Interactive 
mode.  The  main  gain,  besides  the  usual  one  of  flex- 
lbl.lty,  is  that  there  is  no  nsed  to  program  an  outer 
en  ironment. 

6We  might  have  simply  put  in  another  probe  at  the 
erd  of  the  first  session,  without  reinitializing. 
However,  it  would  not  have  behaved  properly  (why?). 

?Thi«  tZ  is  necessary,  since  the  system  allows  the 
user  to  do  whatever  he  pleases  after  ATTEND  sends  its 
message,  hence  cannot  know  until  Cold  when  the  user  is 
finished  and  wishes  to  return  control  Co  ic. 
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ooioo  pssu  stab:? 

00200  0  STM-  (RfADY  (flW  1)  (ELM  4)  (ClM  9)  NX.  Mil 

00300  P04  TRUE 

00400  0  ACTION-  AT  If  NO 

00*-'*:  ATTEIOtNC  -  WUT  NEXT  9TMULUS  •  (PROOF  4) 

00600  - 

CO  700  1  ACTON-  IPPCQE  4) 

00800  2  STM-  ((PROW  4)  READY  (EtU  l )  'OM  4)  (£IM  9)  ML1 

00900  P02  TRUE 

01000  2.  A'  ION-  (RESPONSE  YES) 

01  tOO  3.  ACTION-  RESPOND 

01200  4  ACTION- (NIC  (RESPONSE  AW?) 

01300  5.  ACTION  t$AY  ANY) 

01400  . 

01900  imhwh  rCS 

01600 

'*  01700  •.  ACTION- (OU) 

018#)  7  STM  ((OUMR£$T>ONSt  YCS»)  (PROOF  4|  (CIM  4)  PCAOY  (ELM  1HB.M  •)) 

01900  POl  TffuC 
02000  7  ACTION-  (010  «♦> 

02100  8.  STM  «aO  (PROBE  4)>(0L0  (RESPONSE  YES))  (£LM  4)  «AOY  (ELM  I  HELM 

02200  P04  TRUE 

02300  8.  ACTION-  ATTEND 

02400  ATTCNDMG  -  INPUT  NEXT  STIMULUS  « 

02500 

02400  PS.ST1  START! 

02700  0  STM  (READY  (ElMt)  (ELM  4)(EtM9>  NX.  ML) 

02800  POO  TRUE 
02900  0  ACTION- ATTEND 

03000  ATTENDtfKl  -  WJT  NEXT  STIMULUS  -  (PN06E  9) 

03100  - 

03200  1  ACTION-  (PROW  I) 

03300  Z  STM- ((PROOC  8)  REAOY  (EUU)  (ELM  4HELU  4>  MU 

03400  P03TRUE 

03500  2  ACTION  (RESPONSE  NO) 

03400  3.  ACTION  RESPOND 

09700  4  ACTION- (MTC  (RESPOf.SF  ANY)) 

03800  B  ACTION- (SAY  ANY) 

03900 

04000  •»«♦«♦«  M5 

04100 

04200 

Fi8.  5.  Run  of  PS.3T1  on  positive  and  negative  cases. 


The  ANY  in  RESPOND  is  used  to  communicate  between 
one  action,  which  sets  the  value  of  '.KY,  and  anrtt.  -r, 
which  needs  to  use  it.  This  communication  of  values 
from  one  action  to  another  occurring  later,  or  from  a 
condition  to  its  action,  implies  the  existence  of  mem¬ 
ory.  By  the  nat—e  of  things,  this  memory  cannot  be 
STM  (which  would  lead  to  an  infinite  regress).  On  the 
ocher  hand,  thie  memory  occurs  only  over  the  sco-ie  of  a 
single  production.  This  is  a  short  time,  providing  we 
restrict  the  time  taken  by  a  '<-ion.  For  instance,  <.e 
should  not  permit  an  entire  p»  iccion  system  to  be 
evoked  by  one  action  before  going  on  to  the  next  action,® 
Thus,  our  control  system  must  posit  a  very  short  tens 
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buffer  memory  in  addition  to  the  STM  working  memory. 

The  M11GIT'  in  PD2  nerves  to  restrict  the  match  to 
work  on  digits  {so  that  e.g.,  (KIM  BOAT)  would  not  be 
recognised).  No  such  restriction  occms  with  ANY, 

More  importan " ,  It  serves  to  enforce  the  equality 
between  two  occurrences  ci  digits,  since  the  value 
assigned  at  the  firsc  place  will  be  used  at  the  second 
and  give  a  catch  only  If  the  same  digit  recurs.  Thus, 
the  multiple  occurrence  is  performing  a  major  function 
of  the  task — the  equality  teat  of  probe  digit  and  member 
digit.  Whether  there  can  be  multiple  occurrences  of 
variables  In  3  condition  is  an  independent  psychological 
question.  To  replace  It  with  the  provision  that  a 
variable  can  occur  but  once  on  the  condition  side  la 
tantamount  to  making  only  Identification  possible 
(Including  class  membership).  This  would  imply  that  a 
primitive  operation  of  equality  testing  would  be  re¬ 
quired,  to  be  used  in  the  action  part..  The  processing 
implications  of  one  assumption  or  the  other  is  unclear, 
since  what  additional  memory  and  control  la  required 
within  the  match  to  accomplish  multiple  occurrences 
depends  on  the  mechanism  used  to  Implement  the  match 
(in  particular  the  amount  and  kind  of  parallelism),9 

How  do  ve  know  this  production  system  is  the  right 
scrt  of  mechanism,  given  the  results  of  experiments? 

We  need  to  adopt  an  explicit  timing  model,  so  that  we 
can  compute  the  total  time  t-ken  in  performing  the  task. 
The  central  assumption  we  will  make  has  three  parts: 


8Sueh  facility  represents  good  programming  language 
design.  In  which  onr  wants  Indefinite  capabilities  for 
recursion.  However,  ve  are  trying  to  model  the  human 
control  system,  not  construct  a  neat  system. 

Hfe  state  all  these  issues  to  show  that  the  con¬ 
ventions,  of  the  production  system,  which  may  appear  to 
be  linguistic  In  nature,  contain  substantive  psycholog¬ 
ical  assumptions. 
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The  time  Co  evoke  tht  next  production  is 

Independent  of: 

(1)  the  number  of  productions  in  the  system! 

(2)  he  contents  of  STM; 

(3)  the  condition  of  the  evoked  productions. 

Th.'.  iisumptionK  are  meant  only  as  a  first  approximation. 
However,  they  do  rule  out  time  being  proportional  to  the 
number  of  productions  (the  assumption  that  comas- nat¬ 
urally  from  the  definition  of  a  production  system  and  1 
its  implementation  on  a  digital  computer).  _  '  " 

In  favor  of  Part  (1)  is  the  circumstance  that  in 
writing  a  production  system  (PS.ST1  op  any  other)  we  - 
only  put  down  a  few  of  the  .traditions  to  which  the 
subject  is  presumably  sensitive  auu  could  respond  to 
if  the  situation  (i.e. ,  the  contents  of  STM)  warranted: 
a  wasp  lighting  on  the  apparatus,  the  smell  of  smoke, 
an  Irrelevant  remark  in  the  background,  turning  off  the 
lights  and  so  on,  any  of  which  would  surely  evoke  a 
noticing  operation  and  subsequent  alteration  of  the 
contents  of  STM.  While  reaction  to  Such  conditions 
might  be  somewhat  longer,  in  no  way  could  the  subject 
be  imagined  to  Iterate  through  all  such  possible  con¬ 
ditions  taking  an  increment  of  time  per  possibility. 

Thus,  the  set  of  productions  we  work  with  bears  no 
relation  to  the  sec  of  productions  that  we  envision 
constituting  the  LTH.  More  generally,  the  basic  control 
structure  is  to  be  viewed  as  one  of  a  recognition  fol¬ 
lowed  by  an  action  followed  by  a  recognition  again— 
the  act  of  evoking  the  next  action  (nr  mini-sequence  of 
actions)  being  the  basic  pulse  of  the  system.11' 

Parte  (2)  and  (3)  of  the  assumption  are  not  quite 
eo  compelling  end  alternatives  can  be  imagined.11  That 


1 ^Thi s  recognition-act  cycle  is  to  be  contrasted 
with  the  basic  fetch-decode-execute  cycle  which  is  the 
primitive  control  structure  of  the  digital  computer. 
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#11  Che  conditions  are  tested  simultaneously  implies 
that  the  time  to  determine  the  next  production  depends 
on  all  the  unsatisfied  conditions  as  well  as  the  one 
that  Is  chosen.  Thus,  no  strong  dependence  could  exist 
on  the  particular  Items,  and  in  any  evenf  all  the  items 
in  STM  must  be  'nvolved  In  the  processing,  not  Just 
thos'  that  enter  into  the  selected  production. 

These  three  assumptions  Imply  that  it  takes  a 
conscant  amount  of  time,  call  it  T.  evoke,  to  determine 
the  next  production  to  be  executed.  Each  production, 
of  course,  evokes  a  sequence  of  actions.  The  total  time 
to  accomplish  the  sequence  nay  be  variable,  depending  on 
the  exact  actions  that  occur.  The  simplest  assumption 
Is  one  of  aetnality:  that  each  action  takes  a  fixed 
amount  of  time  and  that  the  time  for  the  sequence  Is 
the  sum  of  the  times  for  each  action.  Even  simpler  is 
the  assumption  chat  each  action  takes  the  same  time, 
call  it  T. action.  Under  this  assumption  the  time  for 
a  production  with  N  actions  can  be  written; 

T. production  -  T. evoke  +  S  *  T. action 

The  special  case  of  T. evoke  •  0  Is  worth  a  moment's 
attention.  The  obvious  interpretation  is  that  It  takes 
no  time  to  evoke  the  production  (l.e. ,  Co  recognize 
what  action  sequence  to  perform)  and  all  the  time  is 
taken  by  the  performance  of  actions.  An  alternative 
interpretation  Is  that  only  a  single  action  can  be 
evoked  at  a  time.  That  is,  writing  of  a  sequence  of  N 
actions  la  simply  a  shorthand  for  writing  N  productions, 
each  of  which  has  a  condition  and  a  single  action.  We 
assert  thereby  that  the  conditions  are  so  unique  that 
only  the  production  associated  with  the  next  action 
would  fire.  Under  this  assumption  the  total  time  of  a 
production-as-vrttten  with  N  actions  is: 


* ‘For  instance,  considering  elements  in  order  from 
the  front  t«&STM  am!  evoking  the  first  satisfied  produc¬ 
tion  would  make  the  time  dependent  on  the  contents  of 


STH. 
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T. production  »  N  *  (time-to-evoke  +  tirae-for-actior.) 

The  two  times  coalesce  to  form  the  Traction  in  the  top 
formula. 

The  simplicity  of  these  assumptions  should  not  be 
disturbing.  Their  complication  can  be  left  to  the 
impact  of  specific  data.  Even  in  this  simple  form  they 
offer  guidance  in  the  analysis  of  a  production  system. 
Notice,  by  the  way,  that  the  prodjction  system  has  a 
built  in  seriality  In  the  sequence  of  production  evoca¬ 
tions,  independent  of  whether  we  make  the  serial  assump¬ 
tion  for  performing  a  sequence  of  actions  for  a  given 
production.  Roughly  speaking,  the  time  to  do  a  task  is 
proportional  to  the  number  of  productions  evoked  to  do 
the  tesk. 

Given  this  much  of  a  timing  model,  it  can  be  seea 
from  Figure  5  that  PS.ST1  produces  an  answer  in  a  time 
that  is  independent  of  the  size  of  the  positive  set 
(essentially,  T, evoke  +  5*T. action).  Thus  PS.ST1  dis¬ 
agrees  fundamentally  with  the  empirical  results.  Con¬ 
sequently,  let  us  explore  other  methods  for  the  task 
(putting  to  one  side  for  the  moment  what  is  implied  by 
not  using  a  scheme  of  action  that  seems  possible 
a  priori). 

PS,ST2:  Terminating  Search 

Figure  6  shows  a  production  system,  PS.ST2,  that 
performs  the  task  by  explicitly  searching  through  each 
of  the  members  of  the  positive  set,  PD2  in  Figure  6 
looks  very  similar  to  PD2  In  Figure  4.  However,  there 
in  a  critical  difference.  In  Figure  4  the  digit  selec¬ 
ted  by  -’DIGIT'*  is  defined  by  the  probe;  thus  this  seeks 
out  an  clement  in  STM  that  the  same  digit.  In 

ooioo  Past?  <roi  ro?  ro*  rat 
oozoo  » 

ooooo  rai  <(w*w*/f«uniD<wsiWK*)  •*  #a.o  *•» 

00*100  P02  «UM  A?«  <PW*tf  *0K3?*)  ->  {BtSPONSC  «$> 

00500  WSPONO) 

00*00  POO  HELM*  AM)  iPStXK)  «*  0  •»)> 

00700  P04-  «PHC** >  AND  iCIM)  *8S  -*  <3£S**(5M*  US!  JtlWW 
001*00  P05-  (SfApY  ATTEND* 

00*00  t. 

Fig.  6,  PS.ST2;  Linear  terminating  search. 
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Figure  6,  the  digit  is  selected  by  the  first  (ELM. ..) 
in  STM;  only  1c  this  h.*s  the  same  as  the  probe, 

will  there  be  a  match.  If  this  doesn’t  occur,  the  next 
production  (PD3)  then  modifies  the  first  element  so  that 
it  will  not  be  sensed  again  by  PD2.  Thus,  these  two 
productions  work  through  the  positive  set  and  will  find 
a  match  if  it  exists.-  Only  if  no  more  elements  exist, 
will  PDA  be  evoked  and  say  NO.  (PD1  and  PD5  are  iden¬ 
tical  to  PD1  and  PDA  respectively  of  PS.ST1.) 

The  condition  of  PDA  involves  detecting  the  absence 
of  an  element  in  SIM,  indicated  by  the  ABS  following 
the  elemert.  Thus  PDA  will  not  be  evoked  if  there  is 
an  item  in  STM  of  form  (ELM...).  This  happens  not  to 
be  strictly  necessary  for  PS.ST2  to  work,  but  somehow 
providing  a  production  that  could  be  triggered  to  say 
NO  on  the  occurrence  of  the  probe  alone  seems  risky.; 
Suppose,  for  instance,  the  probe  srrived  simultaneously 
with  the  ready  signal.  PS.ST2  would  behave  right;  a 
system  with  only  (PROBE)  in  the  condition  cf  PDA  would 
not,  producing  NO  immediately.- 

We  have  now  introduced  all  but  one  of  the  ingre¬ 
dients  of  matching:  (1)  the  matching  of  items  in  STM; 

(2)  the  conjunction  of  condition  elements,  either  for 
presence  or  absence;  (3)  the  use  of  variables  and  clas¬ 
ses  (which  operate  as  variables  with  restricted  domains); 
and  (A)  the  rules  for  matching  an  element  (or  subelement) 
cf  the  condition  with  an  element  (or  subeleraent)  of  the 
STM,  namely  subeleoent  by  subelement,  working  from  the 
front,  but  allowing  the  tail  of  the  STM  element  to  not 
be  matched  (e.g.,  (EE)  matches  (EE  FF)>.  The  one  addi¬ 
tion  (to  occur  in  the  next  example)  is  f5)  permitting 
a  variable  to  have  an  associated  domain  locally.  An 
example  of  this  is: 

(A  XI  ««  (B  C)  D)  where  XI:  (VAR) 

This  says  that  XI  must  match  (B  C).  Thus  the  entire 
condition  element  matches  (A  (B  C)  D),  but  not  (ABC  D), 
<(B  C)  D)  or  (A  (C  B)  D).- 

Examination  of  the  logic  of  PS.ST2  shows  that  the 
rime  is  indeed  proportional  to  the  sire  of  the  3et 


21 


VISUAL  INFORMATION  PROCESSING 


00100  C  STM-  (READY  (ElM  1)  (flM  4)  (ELM  f1  Ml  f#0 

00 TOO  «»  TRUE 

00300  0  ACTION-  ATTEND 

00400  AT?fNC*NG  -  INPUT  NEXT  STIMULUS  m  (PROSE  4) 

00000  - 

00600  1  ACTION-  (PROBE  4) 

C0700  7  STM  ((PROBE  4)  READY  (ELM  l )  (ElM  4)  (ELM  9)  MU 

ooaoo  rcn  true 

00900  ?  AC’ ION  (OLD**) 

01000  3  SIM  U&DlflM  1»  (PROBE  «>  READY  (ELM  4)  (ElM  9)  Ml) 

01100  P02  TRUE 

01200  3  Av.  mCJ^- (RFSPONSC  YTS: 

01300  4  ACTION  RESPOND 

01400  5  ACTON-  (NTC  (RESPONSE  AW) 

01  BOO  6  ACTION-  (SAT  ANY) 

01600 

01700  •**»«•••••  yes 

01800 

01900 

Fig.  7.  Run  of  PS.ST2  on  positive  case. 

OOlOO  0  STM  (READY  (ELM  1)  (FIM  4)  (ElM  *>  ML  HLi 

00200  P05  TRUC 

C0300  O  A ''▼(ON- ATTEND 

00400  AT  .  TNOING  -  NPUT  NEXT  STIMULUS  *  (PROSE  •) 

00600  * 

00600  1  ACTION-  (PROSE  8) 

00700  2  STM  ((PROOF  8)  READY  (f  LM  1)  (ElM  4)  (ELM  9)  fit) 

0060C  POT  TRUE 
'X)900  2  ACTION-  (CUD  *•) 

01000  3  STM  , Oir  (II  M  I))  (PROBE  8)  READY  (TIM  4)  (ElM  9)  Ml) 

01100  PD3TRUE 
01200  J.  ACTION-  (OLD  •»> 

01300  4  STM  «0L0  (EtM  4)10  *E  8M01D  (HM  |»  READY  (ELM  4)  Ml) 

01400  P03  TRUE 

0)500  4  ACTON-  (OLD  •») 

01600  5  STM  (TCH  0  (ElM  9)1  (PROSE  8)  (OLD  (ElM  4))  (OtD  (ELM  IM  READY  ML) 

01700  P04  TPUF 

01800  5.  ACTION-  NO) 

01900  6  ACTON-  PrsPC-fO 

02000  7  ACTK'N  (NIC  (RtSPONSl  ANY)) 

02100  8  ACTION  (SAY  ANY) 

02200 

02300  .».»♦••»*»  MD 

02400 

02500 

Fig.  8.  Run  of  PS.ST2  on  negative  case. 


searched  requiring  one  evocation  and  one  action  for  each 
element  examined  that  is  not  the  prroe  and  then  one  more 
(PD2  if  positive,  PD4  if  negative)  to  generate  the  re¬ 
sponse.  However,  as  demonstrated  in  Figures  7  and  8, 
PS.ST2  does  a  self-terminating  search.  It  looks  at  all 
the  elements  in  the  set  in  the  negative  case  (Figure  8), 
but  only  half  the  elements  (on  the  average)  in  the  pos¬ 
itive  case  (Figure  7),  thus  making  the  fuope  of  the  pos¬ 
itive  case  appear  only  half  of  what  It  is  in  the  nega¬ 
tive  case.  But,  as  not<*d  earlier,  the  evidence  is 
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unequivocal  that  the  slopes  ere  the  same  for  the  pos¬ 
itive  and  negative  cases**  Furthermore ,  there  is  no 
serial  position  effect  (ns*  there  would  oe  in  PS.ST2). 
Thus,  we  have  not  yet  found  a  method  for  doing  the 
tank,  that  has  the  right  characteristics, 

PS,  ST 3  and  PS.ST4:  Encoded  Representations 

The  system  of  Figure  9,  PS.ST3,  introduces  the 
notion  that  the  set  is  actually  held  in  sn  encoded 
representation  (i.e. ,  as  a  chunk).  Thus,  we  have 
changed  the  STM  to  hold  some  irrelevant  ^ terns  prior  to 
the  start  of  the  trial.;  At  the  READY  signal  the  encoded 
positive  set  is  brought  into  STM  (PD5) . 

The  positive  set  is  encoded  as  a  nested  set,  as  can 
be  seen  in  the  action  side  of  PD5  for  a  set  of  three 
elements.  A  set  of  five  would  have  the  form: 

(X  (X  (X  (X  X)))).  This  means  that  a  single  production, 
PD2,  can  perform  the  decoding  by  repeated  application. 
The  pol».c  £*  introducing  the  decoding  is  that  the  entire 
set  must  be  decoded  before  any  farther  processing  is 
done  on  it*  Thus,  the  time  to  decode  will  be  indepen¬ 
dent  of  V.*ether  the  result  is  to  be  positive  or  neg¬ 
ative.  Thus,  PS.ST3  satisfies  the  experimental  results 
that  lead  to  the  inference  of  the  exhaustive  search. 

It  does  sc,  however  jy  attributing  the  time,  not  to 
search  (which  U  uune  in  constant  time  by  PD3  fltid  PD4), 
but  to  a  linear  time  to  decode  the  expression.  Figures 
10  and  ll  chow  runs  on  PS.ST3  in  the  positive  and  neg¬ 
ative  case  that  illustrate  thia.  It  c.  n  l>e  se«n  that 
the  time  to  do  the  task  ie: 

T. total  *  2*T.action  +  N*(T.evGke  H  3*T. action) 

Examination  of  PS.ST3  shows  that  what  enforces  th. 
compulsive  decoding  before  testing  is  that  PD2,  tbs 
decoding  production,  occurs  before  PD3  and  PD4,  tne 
comparison  and  response  productions.  Why  don’t  we 
ai-oly  reverse  the  order?  Then  we  should  catch  the 
elements  as  they  are  being  decoded,  and  reinstitote  a 
termination  search.  Figure  12  shows  the  result,  using 
PS.ST3X  which  is  simply  a  reordered  version  of  PS.ST3. 
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OOIOO  tj-  C/Aki 
00200  v;  tVA*» 

00300  i 

rowo  ps  s*3  <mj  rrv  rco  poj  pw  pc«> 

00600  i 

oo«oo  rot  «p«o<if>  am  ((no  <pf  sPONSf » 

00?00  PD?  «M  XI  \?>  WUPfiCSn  -M(X0*»>X2)tl> 

CK*SOO  PD3  KPHd-V  IM,  1')  AND  If  I M  035  >>•-'{* S«W*  VtS) 
00900  Rt<P«  ,1) 

01000  PtM  ("1H  Hr  1  A*.**  <{IM>  -  »  <P£SPONS£  NO)  * FSJWO 
GUCO  P05  <•  5APX  Af*JIST,‘  ARS  ■'* 

0t?00  l$£T  (f|M  {)!*??  ('(.WCHFiW  »») ) 

Oi’HJO  POS  ANY  - 

o;«oo  « 

01600  STM  <JUT  <  ML  »  N*  **  )*.  NO 

0J«00  , 


Fig.  9.  PS.STj:  Nested  representation. 


OOIOO  0  STM:  <A*«  N*  Ml  NA  Ml  ML  NO 

00200  P&6T5JUC 

00300  0  ACTION- AT  TEN) 

00400  Af-XNOCNG  -  **UT  MDTT  fTAAJLUS  •  WAOT 

ooeoo  - 

C0*00  l  ACTION  W'OY 

C0700  2  STM  (READ.  A«K  Ml  MINIMI  Ml) 

cocoa  tow 

00900  7  ACtKN-  (SET  <EIM  <)  (SET  (£IM  4)  (CLM  ♦»' 

01000  3.  STM  WiCT  ItXM  1)  <$£T  (£lM  *)  1CLM  S»|  PfAOY  JINK  Ml  ML  Ml  Nl) 

0H0O  RMSTPtft 
0.700  3  ACT<W  A*T£N3 

0)  JOO  AT Tf  NT>NG  -  INPUT  NEXT  sriMULiS*  IMPOSE  A) 

C1400  * 

01600  4.  ACTION-  iPOOOf  <) 

OifOC  5  STM  KPPOf*  A)  <S£T  (flM  1)  ISET  'flM  4>  (OM  •)))  RCADY  JINK  ML  ML  MU 

01700  P»TM 

OlftfXJ  6.  ACTON- <*»  0  .»» 

01  SVC  b  ACT ON- X? 

OtCOO  7.  AC5»«  XI 

07)00  3.  S’MHt.MIKSn  ElM4)(ElM*»4ao<SfT(aM  »(SCT(CU<4>C£IV  »»» 

02160  xfW  -j)  PfAOt  JL?A  MU 

07700  PD7  TrfUE 

0?<00  £  AC’CN  (f»0«l 

i>?400  9  ACTXVC  x? 

02600  10  ACMON  X) 

0?fc00  II  5TW  KUM  i)ct'M9)(0iD«srT«lVM><E»M9))MPWinE  4MEIM  1> 

0?M*>  I0(  J  I  {» l  M  1>  (SET  <£|  U  4>  J£IM  9H»  PtA43Y> 

07700  P03  mur 

07*00  1)  AC  ION  (Ri  VVYJSf  VTS5 
OTCOO  12  ACT<Y,-f  Slx^iO 
03000  I J  ACTON-  <MC  STOPS  hitrV 
03100  M  AC'W  ISAY7W 
O'TTOO 

03300  ••<•••«•*•  YES 

03400 

03600 


Fig.  J0>  Run  of  PS. ST 3  on  positive  case.. 
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00103  0  $1*4-  (JUNK  NX.  ML  **  NX.  Ml  MU 

00700  POfi  TRUE 

CO 300  0  ACTON-  AfTENO 

00400  ATTENDS  -  MV»  NEXT  STMAUS  #  «AOY 
00500  - 

00400  1  ACTION-  READY 

00700  2  STM-<*tM7Y  JUNKNA  NX  Ml  N4  MU' 

00900  PDb  TRUE 

00900  2  ACTK5N-1SET  (£IM  l)($£T<nM4)(Cl^*W 

OICOO  3  9lMr««t  tfiM  1M«*T  (ELM  4MflM  J)»  RCAfTY  JU^tt  ML**  *l> 

OtlOO  POC  TRiC 

01200  3.  ACTON-  ATTENO 

01300  /*TTrfONG-WPUl  NEXT  STIMULUS 

01400  * 

01500  <  ACTION-  rP*X«J  ,  - 

01400  B.  S1U{(POoaE8)(S£TCCLMlMS€T{ElM4HaMt)H«AOYJUN5C»*LMlT«.> 

01700  P02TRUC 
01*00  ft  ACTION-  <OtO  •*) 

011 00  4  ACTON*  X? 

09000  7.  ACTION-  XI 

02100  4  STM.  i<aw  1)  (SET  (ELM  4)  {ELM  •»  KUO  (SET  (ELM  I)  u*T  (ELM  4J  (ELM  »>»> 

02160  (PHONC  I)  READY  JUNK  WO 

02200  PWTRiC 
02300  ft  ACTON-  (OLD  *»i 
02400  ft  ACTION-  X? 

02900  10  ACTION-  XI 

02*00  II  SlV-<(£lM4)(ElM9)<0l0<sr  (riM4)'ElUSm{PB0eee>(ElMt> 

02*79  <aO  <S£T  (ElM  l»  (SET  IFl  M  1)  (ElM  *)))?  RTAOY) 

02700  P04  TRUE 

02800  11  ACTON- (RESPONSE  N0> 

02100  12  ACT^N-  RESPOND 

03000  13  ACTION- (NTC  iTCSPONSf  ANY') 

03100  »4  ACTION- (SAY  ANT) 

03200 

03300  NQ 

03400 

09500 

Fig.  11.  Run  of  PS.ST3  on  negative  case.: 

Trouble  results,  as  we  see,  since  Pn4  responds  to  the 
non-satisfaction  of  PD3  by  declaring  NO  immediately , 
thus  causing  an  error. 

What  ways  exist  of  patching  up  the  system  90  It 
avoids  the  difficulty  of  Figure  12,  while  preserving 
tha  self-terminating  features?  PD4  must  be  inhibited 
while  decoding  gees  on,  whereas  PD3  must  not  be.  The 
simplest  solution  is  to  split  the  two  productions, 
putting  PD3  ahead  of  PD2  and  PD4  afterward.  This  works 
.just  fine.  Other  alternatives  involve  making  PD4 
Conditional  upon  the  set  being  completely  decoded.  This 
can  :be  doce,  for  instance,  by  changing  PD4  to: 

PC4?  ((PROBE)  AND  (SET)  ABS  —  (RESPONSE  NO)  RESPOND) 

Thus,  although  introducing  the  idea  of  decoding  per¬ 
mitted  us  to  produce  a  version  with  the  correct  timing 


-  2*> 


VISUAL  INFORMATION  PROCESSING 

OOIOO  PS  ST3*  (Rll  PC?  P(»  RM  P05  «*l 
00200  poi  Af>nirt0{»tspi»«v) --‘{OiOMii 

00300  PD3  «P»MT  i*(-T  '>  AM)  (ELM  <K-Jl  '1  ■•'  <wf?POf«£  YtSI  RESPOND) 

00400  P04  (|P«t.S)t)  (WSPOr.tf  NO)RlSPONOi 

00500  P02  ((SET  XJ  X7)  AAO  i HM* )  ->  MAC  #0  X2  K|» 

0M03  PO»!  i»f Airr  AND (SHI  AQS  ■»  (SET  ItlM  11  (SET  (ELM  4)  (RM  #»» 

00700  P06  (ANY  '  AMCNO) 

00800 

OOSOO  PSST3X5TA31- 

01000  0  STIA  UP*  Ml  Ml  NB  Ml  ft  Ml 

OHOO  PD©  1  RUE 

0l?00  0  ACTION  A  Hi  NO 

01300  AfTfNWC.  -  WPU1  NEXT  STA4ULUS  -  RE  AO* 

01400  - 

01500  l  AC  I  ON  READY 

01600  2.  STM.  (HEADY  JUT*  M  Ml  ML  Ml  NA1 

01700  P05TRLC 

31800  2  ACTION  (SET  (ELM  lMSET  (RM  4HEIM  tj» 

01  WO  a.  STM;  ((SET  (ELM  1MSET  (RM  41  (RM  •))>  READY  JUNK  Ml  Ml  Ml  ML) 

02000  P06TRUC 
02100  3.  ACTION*  ATTEND 

02200  ATTENDING  *  WVT  NEXT  STMJtUS  -  (PROSE  4) 

02300  - 

02400  4  ACTION- (PROOE  41 

02500  5.  STM;  M PROBE  4)  (STT  (RM  « (SET  (RM  4)  (ELM  ?)))  READY  Jtf*  Ml  ML  Ml) 

02600  P02TWJE 

02700  5.  ACHON-(OLO*«» 

02800  6  ACTION  X? 

02WO  7  AC1K*.  X! 

03000  1L  STM:«tLM  |>»sn  «eLM4)<RMSH(OLO»SET(ElM  1XSET(RM4)(CLM*))H 

03C50  ITOiftt  4)  READY  AjNK  MU 

03100  P04  TRUE 

00200  8  ACTION- (RESPONSE  NO) 

03300  9  ACT.ON- KSPCVO 

03400  1C  ACTlOff  (NTC  (RESPONSE  ANY)) 

03500  II  ACTION  (SAY  ANY! 

03600 

03700  *»»*«»•»••  NO 

03800 

©3W0 

Fig.  12,  Run  of  PS.ST3X  showing  error. 
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Fig.  13.  PS«ST4j  Linear  representation. 
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properties,  we  have  found  minor  variations  of  the  same 
scheme  that  re-instate  the  terminating  condition,  and 
appear  to  be  somewhat  more  efficient  than  the  exhaus¬ 
tive  scheme. 

Figure  13  shows  an  alternative  fora  of  encoded 
representation  that  appears  to  overcome  these  difficul¬ 
ties.  A  set  is  now  represented  by  a  linear  expression, 
e.g, ,  (SET  A  B  C  D),  Such  sets  cannot  be  decoded  re¬ 
cursively,  but  require  a  set  of  productions,  one  member 
for  each  set  size.  Thus,  PD2  to  PD5  in  PS.ST4  accom¬ 
plish  jointly  the  decoding  of  a  set  in  STM  into  its 
elements.  The  recognition  or  the  larger  sets  occur 
before  smaller  ones,  since  by  the  matching  rules  of  PSG 
the  productions  for  smaller  sets  would  also  be  satis¬ 
fied  by  larger  sets.  The  maximum  site  aet  admitted  in 
PS.ST4  is  four  elements;  it  could  be  extended  to  any 
specific  upper  limit  12 

The  decoding  now  occurs  within  the  action  sequence 
of  a  single  production.  Thus,  it  takes  minimal  time 
(N*T.actlon)  and  there  is  no  opportunity  to  Rlip  in  the 
evocation  of  a  production  (i.e.,  PD6)  that  uould  termin¬ 
ate  the  search.  The  rest  of  PS.ST4  is  the  same  as  in 
PS.ST3.  Figure  14  shows  a  run  on  a  positive  case  that 
Illustrates  how  the  decoding  goes. 

Throughout  the  discussion  we  have  ignored  where 
the  positive  set  came  from.  In  the  first  examples 
(PS.ST1  and  PS.ST2)  we  simply  posited  the  elements  in 
STM  initially.  In  the  later  examples  (PS.ST3  and 
PS.ST4)  we  posited  a  set  In  LTM  already  assimilated  into 
a  production  and  in  the  encoded  form  we  wished  to  work 
with.  We  have  set  to  one  side  the  way  new  productions 
are  created  in  LTM  (i.e.,  the  question  of  LTM  acquisi¬ 
tion  as  it  shows  up  in  our  system) ,  but  the  mechanics 
of  encoding  are  within  our  purview. 

Figure  IS  shows  PS. STS,  which  is  an  augmentation 
of  PS.ST4  to  encode  a  sequence  of  incoming  elements 


I?The  capacity  of  STM  would  appear  to  limit  the  size 
of  the  acta  that  could  be  successfully  decoded;  sc  also 
could  .he  Capacity  cf  the  variable  buffer  store. 


visual  information  processing 


00100  0.  31M=  OW<  ML  Ml  ML  ML  ML  MU 

00200  PM  TRUE 

00300  a  ACTON- ATTEND 

00400  ATTENDING  -  INPUT  «XT  JT&M.U3  •  READY 
00500  - 

00600  I.  ACTION-  READY 

00700  2.  STM:  (READY  JUNK  NL  WE  Ml  Ml  ND 

00*00  POtTRUE 

00900  2.  ACTION-  (SET  (ELAI  1)  (EtM  4)  (£1S4  9» 

01000  3.  STM  ((SET  (ELM  1)  (ELM  4)  (ELM  *W  READY  JUMC  ME  ML  NL  NU 

01100  PD*  TRUE 

01200  3.  ACTION-  ATTEM5 

01 300  ATTEND**)  -  INPUT  NEXT  &TMU-UI  -  <Htj3C  4) 

01400  - 

01500  4  ACTION- (P906C  4)  » 

01600  0  STM  <iWOQE  4)  <3£T  (OX  1)  (OM  4)  (ELM  »»  READY  AM*  ML  ML  MU 

01700  PC3TRUC 

01*00  0.  ACTION- (OLD  w> 

01*00  4  ACTION- X3 
02000  7.  ACTION- X2 
02100  &  ACTON-  « 

02200  f.  STM  (ItlWI  II  (ELM  4)  (£LM  9)  (OLD  (SET  iELM  1)  (ELM  4)  (ELM  >M 

02250  (PRUDE  4)  READY  JUMO 

02300  P04TRUE 

02400  *  ACTION-  (RESPONSE  YS$) 

02600  10  ACTION-  RESPOND 

02*00  1 L  ACTION-  CMC  iWPONSf  ANY)) 

02700  12.  ACTON- ISAY  ANY) 

02000 

02900  MMNMH  VEt 

03000 

03100 

Fig.  14.  Run  of  PS.ST4  on  positive  cafe. 


into  a  sec  with  the  linear  encoding.  Figure  16  shoua 
a  run  where  this  encoding  occurs,  stopping  at  the  point 
where  one  would  go  into  the  rest  of  the  Sternberg  task 
with  a  READY  and  a  (FROBK) .  Again,  there  has  to  ba  a 
separate  production  for  each  set  size,  since  each  item 
of  the  set  has  to  be  acquired  (vith  a  variable)  and 
then  the  new  sec  created.  A  similar  program  can  be 
written  to  construct  sets  in  the  nested  representation. 
In  this  case,  only  a  pair  of  productions  is  needed  (es 
shown  in  Figure  17,  which  gives  only  the  encoding  part 
of  the  complete  system).  This  pair  has  the  property 
that  it  can  construct  indefinitely  large  sets,  chough 
of  course  the  sets  must  still  be  decoded  step  by  step. 

We  have  attended  primarily  to  the  equality  between 
the  slope  of  the  response  time  for  positive  responses 
and  negative  responses,  when  response  time  is  plotted 
against  the  size  of  the  positive  set.  However  the 
negative  response  can  differ  from  the  positive  rasponse 
(Point  6  in  our  Hat  of  empirical  propartlea).  This 
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Kg.  15.  PS. STS:  Linear  representation,  encoding 
and  decoding. 

•fleet  can  be  attributed  to  a  response  bias — that  Is, 
the  subject  sets  himself  to  respond  one  way,  e.g.,  YES 
so  that  the  expected  response  occurs  note  rapidly  than 
the  unexpected  one.  Such  a  bias  could  presumably  be 
adopted  In  either  direction,  which  Is  in  accord  with 
the  empirical  findings.  (For  Instance,  if  there  Is  an 
appreciable  frequency  difference  bstveen  the  occurrences 
of  positive  and  negative  instances,  then  the  response 
is  quicker  to  the  aore  frequent.) 

Given  a  system  such  as  we  have  been  considering, 
we  can  ask  how,  or  whether,  a  response  bias  can  be 
programmed  to  permit  a  more  rapid  response  in  one  or 
the  other  case.  Figure  IS  shows  a  solution,  PS.ST7, 
that  puts  the  (RESPONSE  YES)  element  in  STM  in  advance, 
so  it  dies  not  hays  to  be  done  by  the  positive  response 
production  (PD6) .  We  do  not  shew  what  determines  which 
way  the  bias  goes;  from  the  structure  of  the  production 
system  it  could  be  either  way.  the  actual  else  of  the 
bias  depends  on  ths  difference  between  PD6,  which  now 
alaiply  executes  RESPOND,  and  PD7,  which  has  the  burden 
of  changing  the  response  to  NO.  We  have  shown  three 
different  productions,  FD7A,  FD7B  and  PD7C.  The  first 
doss  not. bother  to  neutralize  (RESPOND  YES),  but  simply 
puts  a  (RESPOND  NO)  ahead  of  it  in  STM.  Presumably 
this  raises  some  problems  about  a  freely  wandering 
(RESPONSE  YES),  but  perhaps  this  could  ba  neutralized 
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H|.  17.  Encoding  production*  for  nested 
representation. 

after  the  response  woe  actually  made.  PD78  and  PD7C 
both  aark  the  YES  respond  OU).  PD7B  does  so  by  locating 
the  raaponie  element  In  Its  condition  part;  PD7C  takes 
an  extra  NIC  action  to  locate  it.  Thus,  ve  have  a  range 
of  tiaa  difference!  depending  on  which  aechanisa  we  : 
opt  for. 
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Fig.  18.  FS.S77:  P$>ST5  with  response  bias. 
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The  final  production  system,  PS.ST7,  comes  close 
to  satisfying  the  several  empirical  propositions  listed 
earlier:  the  linear  dependence  on  set  sice,  the 
equality  of  slope  for  posit ivp  and  negative  eases,  the 
constant  difference  between  positive  and  negative  cases 
and  the  lack  of  a  serial  position  effect. 

However,  the  situation  is  not  perfect.  We  can 
write  the  total  response  time  as: 


T  ■  T. external  +  3*T. evoke 

+  <6  +  X)*T. action  +  N*T, action 


where  X  *  0  for  the  positive  case 

X  *  1,  2,  3  for  the  negative  case 
for  PD7A,  B*  C  respectively. 


Actually,  this  equation  contains  a  small  addition,  to 
the  constant  part.  If  the  system  is  actually  run 
through  both  the  encoding  and  decoding  stages  then 
(RESPONSE!  gets  lost  from  STM  before  it  is  called  by 
(PROBE)  after  decoding.  This  can  be  avoided  by  the 
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addition  of  another  production  that  brings  (RESPONSE) 
to  the  front  when  (PROBE)  is  first  detected: 

TDX:  (READY  AND  (PROBE)  AND  (RESPONSE)  — >  (OLD  **)) 

This  production  goes  right  after  PD1.  It  Barks  READY 
as  old  to  avoid  repetition  of  PDX  itself;  READY. has  in 
fact  done  lte  job  «?r controlling  the  encoding  and  'ini¬ 
tiating  the  response  when  (PROBE)  occurs.  PDX  adds  ona 
T. evoke  and  one  T. action  to  the  constant  pert  of-  T 
above, 'since'  It  is  evoked  on  every  occasion < 

-The  experimental  value  of  >•  slope  of  tine 
against  set  size  is  around  35  as.  Bence  froa  the ' 
equation  above,  T. action  auat  be  around  35  ns.  The 
difference  between  positive  end  negative- cases  is 
either  1,  2,  or  3  tines  T. action,  which  is  to  say, 
either  about  35,  70,  or  105  ns.  Empirically  this 
difference  is  often  found  to  be  around  50  ns,  which 
lies  halfway  between  the  two  values  for  A  and  B. 

Notice  that  both  the  slope  and  the  positive-negative 
differ cnee  are  determined  solely  by  T. action.  T. evoke 
enters  the  equation  only  as  part  of  the  total  ordinate, 
since  this  also  contains  various  peripheral  perception 
and  notor  response  tines  (here  symbolized  by  T. external), 
there  Is  no  way  to  derive  any  independent  infornation 
about  T. evoke.  The  beat  we  can  do  is  make  a  check  of 
reasonableness.., Since  the  total  ordlnata  is  around 
350  ns,  there  is  about  140  as  available  for  T. external  + 
3*T.evoke,  which  does  not  seen  out  of  bounds  if  T. evoke 
is  not  too  large. 

There  is  little  point  in  atteapting  to  assay  the 
seriousness  of  the  discrepancy  between  the  theoretical 
and  eapirlcal  values  for  the  positive-negative  difference 
or  to  explore  various  potential  explanations.  The  model 
is  still  enough  within  the  ball  park  to  raisin  worth 
considering.  Other  aore  pressing  Issues  need  exposing. 

Let  us  note  what  the  control  structure  has  accom¬ 
plished  for  us  so  far.  First,  we  have  been  able  to 
approach  the  task  of  binary  classification  In  the 
Sternberg  paradlga  as  a  programming  task.  We  could 
tell  vhen  an  arrangement  accomplished  the  task  and  vhtn 
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It  did  pot.13  Cnee  a  viable  production  system  was 
discovered,  all  of  its  properties  were  fixed,  to  the 
extent  that  we  had  settled  on  an  explicit  timing  model. 
Thus,  explicit  predictions  follow  for  the  entire  range 
of  inputs. 

In  this  view  PSG  represents  the  basic  structure 
of  the  human  information  processing  system.  It  follows 
thatam,  program  written  in  PSG  should  be  a  viable 
program  for  the  subject.  Only  such  an  assumption  per¬ 
mits  us  simply  to  program  the  task  in  PSG.  However, 
nothing  has  been. provided  to  determine  which  of  all  the 
feasible  production  systems  will  come  to  govern  the 
subject's  behavior.  Cur  example  makes  clear  that  the 
multiple  production  systems  are  possible.  Without  a 
theory  of  which  system  is  selected  the  total  view 
remains  essentially  Incomplete. 

General  considerations  of  the  adaptiveness  of 
human  behavior  lead  one  to  adopt  the  following: 

Pnnaiple  of  adaptations  Other  things 
equal,  the  subject  will  adopt  that 
production  system  that  more  closely 
obtains  his  goals. 

It  la,  after  all,  a  principle  of  this  sort  that  leads 
us  to  believe  that  the  subject  will  come  to  perform  the 
task  at  all,  once  instructed.  For  we  do  not  believe 
that  the  subject  cooes  equipped  with  a  preformed  organ¬ 
ization  for  doing  the  Sternberg  task  (before  encounter¬ 
ing  it  for  the  first  time).  This  organization  is  com¬ 
posed  in  response  to  the  demon  is  of  the  task,  i  e. ,  the 
subject  himself  selects  this  organization,  presumably 
from  among  others  that  he  could  adopt  that  would  not 
solve  the  task.  That  he  should  also  be  able,  say,  to 
use  one  organization  that  rakes  less  time  than  another 
is  simply  another  application  of  the  same  principle. 

Why  then  does  not  e  subject  use  the  more  efficient 


lHfe  do  face  verifying  that  the  program  does  in  fact 
work,  i.e. ,  debugging  the  program.  While  simple  for 
the  task  at  hand,  it  can  become  a  serious  problem. 


VISUAL  INFORMATION  PROCESSING 


schemes,  such  as  FS.ST1  which  recognizes  the  action  In 
i  tine  Independent  of  sec  size  end  (Importantly)  lees 
than  lor.  the  other  systems?  Resolution  can  be  sought 
in  several  dlrvctions .  Possibly  the  timing  model  is 
wrong,  or  the  particular  structure  of  PSG,  or  the 
general  structure  of  production  systems.  A  different 
sort  of  possibility  is  that  additional  constraints 
exist  that  limit  the  production  systems,  that  are  pos¬ 
sible  or  selected.  For  example,  if  the  subject. can't 
learn  a  given  type  of  production  system  or  assemble  it 
on  demand,  then  it  can  be  excluded  from  the  feasible 
set.  Something  of  this  sort,  p»rnape,  makes  us  hesi¬ 
tate  at  splitting  the  response  productions  on  both 
sides  of  the  decoding  productions  in  PS.ST3  (Figure  13) 
He  have  reason  to  be  leery  of  the  linear  ordering  of 
productions,  since  we  do  not  interpret  a  production 
syatem  as  considering  productions  serially,  but  rather 
in  parallel.  If  productions  are  not  completely  inde¬ 
pendent,  but  aie  developed  in  subsystems,  arbitrary 
ordering  may  not  be  possible. 

Notice  that  the  set  of  all  production  systems 
plays  a  somewhat  different  role  here  than  does,  say, 
the  set  of  all  Harkov  processes  in  mathematical  learn¬ 
ing  theory.  In  both  cases  the  set  in  question  is 
indeed  the  set  of  all  theories  under  consideration. 

But  with  the  Harkov  process  the  problem  of  selection 
is  one  of  descriptive  adequacy  (i.e.,  of  the  fit  to  the 
data).  Xn  the  present  case,  since  the  se'ection  is 
ascribed  to  the  subject  (by  a  not  yet  formulated  pro¬ 
cess,  unfortunately)  we  must  confront  the  issun  of  why 
psychologically  one  rather  than  anothei  production 
system  occurs— in  addition  to  the  question' of  whether 
it  fits  the  data. 

Leaving  to  one  side  for  the  moment  the  major  issue 
just  raised,  working  with  the  production  systems  has  in 
fact  led  us  down  a  somewhat  new  path  in  theorizing 
about  the  basic  phenomena  in  the  Sternberg  paradigm. 

The  basic  linear  effect  is  ascribed  not  to  a  search 
process  but  to  a  decoding  process.  This  solution  was 
discovered  In  the  attempt  to  find  a  production  system 
that  fit  the  oasic  phenomena.  One  can  find  in  the 
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literature  some  suggestions  that  encoding  may  be 
Involved  (e.g,,  Sternberg,  1970),  but  no  genuine 
presentatlo  f  auch  a  theory  la  known  to  me.  This 
at  least  IIIl.  .  rates  that  the  additional  level  of 
detail  of  a  control  system  theory  serves  to  generate 
new  hypotheses  about  the  mechanisms  involved. 

This  assumption  about  decoding  le  sufficiently 
hovel  and  sufficiently  central  to  the  model,  that  it 
rates  additional  investigation.  This  will  let  us  ex¬ 
plore  additional  aspects  of  what  a  detailed  theory  of 
control  can  provide. 

The  Decoding  Hypothesis 

He  wish  to  explore  the  decoding  hypothesis  and 
attempt  to  discover  whether  it  is  reasonable  nr 
whether  (as  introduced)  it  is  to  be  viewed  as  »  feus 
ex  naaktna  to  permit  the  construction  of  a  prt  'notion 
system  that  happens  to  fit  the  empirical  data,  "here 
are  two  directions  (at  least)  in  which  to  look.  ’irst, 
we  can  search  for  basic  theoretical  reasons  why  the 
decoding  should  exist.  Second,  we  can  look  at  other 
tasks  to  see  whether  they  too  seen  to  require  the 
decoding  hypothesis. 

Why  V'ccde? 

t 

The  argument  stares  from  the  generally  accepted 
view  (within  an  information  processing  theory  of  human 
behavior)  that  subjects  encode  stimuli  ubiquitously. 
Hence,  the  argument  goes,  the  system  is  simply  unable 
to  pick  a  production  system  Chat  does  not  do  the  encod¬ 
ing,  hence  the  decoding. 

The  argument  has  perhaps  soma  force,  though  it  is 
better  when  kept  r-rber  general.  In  detail,  it  would 
not  seem  to  rule  out  the  decoding  of  the  eet  upon  re¬ 
ceipt  of  the  ready  signal,  rather  than  the  probe,  so 
that  by  the  tin"  the  probe  came  along  only  the  instan¬ 
taneous  matching  productions  would  need  to  be  evoked. 
This  would  not  be  possible  in  the  dynamic  versions  of 
the  task  where  the  set  is  given  sequentially  right  up 
to  the  problem.  But  we  know  that  the  behavior  in  the 
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static  task  (the  positive  set  in  LTM)  and  the  dynamic 
task  (the  positive  set  given  each  time)  are  essentially 
the  same.  Thus  we  must  still  face  the  issue:  Why  not 
decode  the  positive  set  into  STM  at  the  ready  signal? 

Let  us  return  to  the  question  of  adaptive  behavior 
raised  in  the  prior  section  in  a  more  pointed  way:  Why 
should  the  subject  encode  and  decode  a  set  rather  than 
leave  it  in  STM  where  the  task  can  be  performed  in  a 
single  recognition  (as  in  PS.STl)?  Consider  the  follow¬ 
ing  assumption: 

Assumption  of  Unreliable  STMt  The 
contents  of  STM  are  sufficiently 
variable,  noisy  and  unreliable  that 
the  subject  will  adopt  production 
systems  with  lower  risk  from  STM 
unreliability. 

Unreliability  of  STM  could  be  the  case  because  it  fades 
rapidly  or  because  it  is  the  confluence  of  uncontrolled 
input  from  many  sources,  both  from  LTM  and  from  percep¬ 
tion.  The  production  system  itself  is  consonant  with 
such  a  view.  Imagine,  as  argued  earlier,  that  the 
small  production  system  that  we  use  to  describe  the 
program  of  the  subject  is  really  embedded  in  a  very 
large  system.  From  time  to  time  other  productions  may 
be  evoked  Instead  of  the  ones  in  our  set.'  The  only 
effect  of  these,  mostly,  may  be  to  add  Junk  to  the 
memory  and  to  add  some  time  to  performance  (a  few 
X. evokes  and  T. actions).  From  a  control  point  of  view 
the  process  looks  like  cycle-stealing  (as  it  goes  on 
in  most  computers  today  for  input /output) .  From  a  data 
point  of  view  it  makes  the  STM  unreliable. 

Given  such  a  situation  the  rational  way  to  obtain 
reliable  behavior  is  to  work  with  programs  that  are  as 
safe  as  possible— in  which  the  parts  of  the  program  are 
positively  coupled.  In  the  case  at  hand,  if  the  total 
organization  (our  PS.ST7)  both  dumps  the  elements  Into 
STM  and  then  tests  for  a  match,  then  the  test  production 
can  operate  with  the  knowledge  that  the  elements  of  the 
set  are  all  there.  It  is  a  reliable  method  for  solving 
the  problem.  If  the  system  (PS.STl)  simply  scans 
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whatever  Is  in  STM  at  the  probe  signal  when  the  set 
was  dumped  earlier  at  the  ready  signal,  then  it  is  noc 
safe.  The  chance  of  a  spurious  NO  is  appreciable  and 
even  the  chance  of  a  spurious  YES  Increases.  What  if 
the  subject  thinks  about  some  possible  element  during 
the  interval  between  READY  and  PROBE — he  has  no  way  of 
guaranteeing  that  he  will  be  able  to  distinguish  it 
from  a  true  element.  Note  that  he  cannot  process  such 
a  stray  thought,  since  processing  conflicts  with  being 
prepared  to  react  to  the  PROBE  when  it  comes. 

This  argument  ’ssentially  introduces  a  second 
criterion,  reliability,  in  addition  to  speed  as  a 
governor  of  the  production  system  that  the  subject  will 
construct.  We  have  thereby  preserved  the  principle  of 
adaptation.  Against  this  we  have  only  a  qualitative 
notion  so  far  of  how  to  assess  the  reliability  (as  seen 
by  the  subject)  of  a  proposed  production  system.  In 
the  case  at  hand,  an  ad  hoc  argument  goes  some  ways 
toward  establishing  that  the  speedier  production  system 
is  less  reliable  than  the  slower  one  (which  is  also  the 
empirically  correct  one)..  We  should  at  least  package 
this  assumption  in  a  principle: 

Principle  of  Coupled  Sy stems:  When 
attempting  tc  behave  reliably  the 
subject  uses  production  systems 
where  early  evoked  productions 
produce  guarantees  on  Che  contents 
of  STM  that  can  be  utilized  by 
later  productions  (thereby  coupiii  g 
the  productions  together). 

The  argument  above  leads  directly  to  two  quali¬ 
tative  hypotheses,  one  rather  easy  to  verify,  another 
much  harder.  First,  if  the  selection  of  PS.ST7  over 
PS.ST1  is  due  to  a  requirement  for  reliability,  then 
releasing  that  requirement  sh^ul-  move  subjects  to 
adopt  PS.ST1.  As  oentioied  at  the  beginning  of  the 
paper,  the  conditions  for  the  Sternberg  paradigm  are  a 
low  error  rate  (of  the  order  of  a  few  percent).  If  one 
permitted  much  higher  error  raves  and  paid  off  for  speed 
only,  one  should  see  the  slope  disappear.  It  is  unknown 
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of  course,  how  much  the  error  rate  would  go  up,  since 
selection  of  the  reliable  system  Is  based  on  a  choice 
of  the  subject  in  the  face  of  a  task  demand,  not  on 
demonstrated  failure  of  tha  faster  algorithm.  This 
experiment  should  be  rather  easy  to  carry  out  and  indeed 
the  essential  facts  may  already  be  known  (though  1  don't 
k-ow  them). - - - - 

She  second  hypothesis  comes  from  noting  that  ve 
hays  an  instance  of  the  speed-accuracy  trade-off,  which 
is  s' general  phenomenon  much  studied  in  the  literature. 
One  of  that  features  of  that  literature  (which  ve  cannot 
raviow  here)  is  that  no  mechanisms  are  proposed  as  to 
how.  a  speed-accuracy  trade-off  Is  possible.  One  often 
proposes  to  represent  such  s  trade-off  by  a  criterion 
parameter  which,  can  be  changed.  But  (to  my  knowledge) 
this  never  is  embedded  within  a  model  for  hov  such  a 
parameter  effects  a  shift  to  greater  speed  ac  the 
expense  of  accuracy  or  vice  vsrsa.  The  hypothesis  than 
is  that  the  space  of  feasible  programs  is  indsed  rela¬ 
tively  large  and  that  selection  (construction)  of  dif¬ 
ferent  production  systems  with  slightly  different  speeds 
and  reliabilities  provides  the  underlying  ability  of  the 
Subject  to  trade  off  speed  for  accuracy.  Within  this 
hypothesis,  the  freedom  of  programmability  of  production 
systems  far  from  being  a  disturbing  theoretical  feature 
(reflecting  a  preference  that  a  unique  production  system 
exist  for  a  task);  ia  an  essential  aspect  of  the  human 
information  processing  system. 

We  stats  these  two  hypotheses  to  point  out  hov 
having  a  specific  theory  of  the  control  system  is  able 
to  generate  hypotheses  of  the  tathsr  global  nature  long 
favored  by  experimental  psychology. 

Memory  Span 

A  major  advantage  of  a  theory  of  the  control  system 
is  the  applicability  of  the  theory  to  a  wide  range  of 
tasks.  One  should  be  able  to  test  an  hypothesis,  such 
ss  the  decoding  hypothesis,  against  its  indicated  use 
in  other  tasks,  A  particularly  transparent  task  from 
this  viewpoint  is  the  sesndard  auditory  memory  span 
test. 


38  - 


VISUAL  INFORMATION  PROCESSING 

We  can  Cake  toe  cask  as  receiving  a  sequence  of 
elements,  each  of  which  can  be  perceived  as  a  chunk. 

When  Che  signal  to  repeat  occurs,  the  subject  is  to 
repeat  the  sequence  exactly. 

Figure  19  gives  a  production  system  FS.MS1,  for 
performing  Che  memory  span  test  in  the  most  obvious 
way.  The  subject  lets  the  elements  accumulate  in  STti 
and  Chen,  upon  REPEAT,  proceeds  to  respond  wich  each 
one.  It  keeps  from  repeating  an  element  by  marking 
each  element  used.  Thus,  we  get  a  production  system 
of  only  three  productions:  PD1  to  emit  the  response 
and  mark  old;  PD2  to  terminate  the  trial  by  deactivat¬ 
ing  REPEAT  when  no  more  elements  are  left;  and  PD3  to 
attend  to  the  environment,  t'e  do  not  include  an  ini¬ 
tial  ready  signal  in  this  simple  version. 

Figure  20  gives  a  run  of  PS. MSI  on  a  sequence  of 
three  elements.  We  have  modified  Che  executive  struc¬ 
ture  so  that  the  ATTEND  operator  goes  to  a  list,  * 
STIMULUS  (given  at  the  Cop  of  the  figure)  ,  and  attends 
to  each  symbol  successively.  Although  all  members  of 
the  sequence  are  emitted,  the  system  does  not  obtain 
them  in  the  correct  order.  A  moment's  consideration 
shows  that  this  is  not  a  fluke.  The  STM  is  indeed  a 
•tack-like  memory  which  performs  generally  in  a  last- 
in  first-out  manner. 

Bow  can  this  order  be  reversed/  There  are  two 
directions  to  explore;  reversing  at  response  time;  and 
reversing  at  input  time  so  that  the  response  process 
works  off  -.omething  in  the  right  ord  r.  Let's  consider 
each  in  turn. 

Simply  tring  to  pick  up  the  last  element  of  a 
given  type  in  STM  appears  difficult.  The  subject 
(i.e. ,  the  production  system)  knows  about  the  elements 
only  that  they  belong  to  the  same  type  (e.g.,  are  marked 
ELM).  The  nature  of  the  match  is  such  that  Che  more 
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Fig.  19.  PS.VS1:  Simple  PS  for  memory  span. 
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Fig.  20.  Run  of  PSJISl, 
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rccA'it  cleafeitts  vi.ll  be  selected  first.  Thus,  the  only 
v:!;J  to. git  the  last  clement  Is  by  brute  forte— by  pro¬ 
ductions  that  ia'.cn  onto  ail  preceding  elements,  One 
needs  a  net  of  productions  of  the  fore: 

XI  ASM  X?  AMO  X3  AMO  Xi  ... 

'  XI  ASiS  X2  AMD  X3  — ■»  ... 

XI  ANC  X2— >  ,,, 

Xi  —>  .... 

Bhile  this  bears  aua-j  resemblance  to  the  encoding  pro¬ 
ductions,  it  still  seems  like  sn  uncomfortable  way  to 
do  business..  •" 

An  oltereariyc  strategy  is  to  nark  each  element 
os  It  sr.ters  in  a  unique  way  so  that  that  production 
system  cun  know  about^he.  first  one..  This  essentially 
produces  an  3TH  poifnd-asaoclijite.  structure,'  e.g. , 

sw.f  C.ua3.e?  .,.  <ga$2  « 

tfith  this  art*  gertvnt  the  response- prodocsioia.  bcv«  to 
be  sn  explicit  -i  at,  knowing  first  to  respond  vtth  -■■  . 
(HtMl) ,  then  vlt.c  (EIM2)  ,  etc. "  Again,  it  seems  a 
possible,  but  awktsrd  strategy.  However,  sn  ittc*pt 
on  the  part  of  a  subject  to  use  the  1-BbK.  S-Slftiti 
mnemonic  on  the  meoo.-y  span  teat  would  be  an  application 
of  this.  (General  ex,  erienc  i  is  that  p  re  sent*  c ion  rs  tee 
of  1  aymbol/sec  are  to*,  fast  for  this.)  '-.f- 

As  a  final  example  of  the  reverse-while- responding 
strategy,  the  system  could  respond  internally  as  in 
Figure  20,  which  reverses  the  erder,  and  then  respond 
again  externally,  thus  emitting  them  in  the  right  order. 
This  is  also  a  conceivable  strategy  and  in  slightly 
different  circumstances  can  be  detested  (e.g.,  in  recit¬ 
ing  an  alphabet  backwards,  McLean  A  Gregg,  1967).  It 
seems  sn  unlikely  strategy  in  the  simple  memory  span. 

It  should  produce  a  substantial  delay  before  the  first 
response;  further,  she  task  of  repeating  the  set  back¬ 
wards  should  be  easier  than  repeating  it  forwards  and 
should  not  have  the  delay.  Empirically  these  seea  not 
to  be  the  case. 
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Turning  to  strategies  of  reversing  on  input,  the 
attempt  to  do  this  for  each  element  at  each  moment  of 
input _<r«£tc»  a  fair,  amount  of  thrashing,  in  which  the 
set  of  already  ordered  elements  must  be  brought  in 
front  of  each  new  element  end  still  left  in  the  same 
order. 

A  second  scheme  is  to  encode  the  elements  on 
input,  just  as  ve  have  done  for  the  Sternberg  task. 

This  leaves  a  single  chunk  in  STM  which  is  decoded  in 
the  right  order  at  response  time.  Figure  21  gives  a 
production  system,  PS.MS2,  for  this  encoding.  To  show 
the  relationship  to  the  Sternberg  task  we  have  labeled 
the  productions  with  the  ones  they  correspond  to  in 
PS  ST?  (Figure  18) ,  the  final  production  system  for 
the  Sternberg  task.  Productions  PD1  and  PD1.1  are  the 
response  productions  and  are  unique  to  the  task.  Pro¬ 
duction  PD1.1  ie  the  response  production  for  the  mem¬ 
ory  Span  task,  and  takes  the  place  of  PD6  and  PD7  in 
the  Sternberg  task.  FD12  in  the  Sternberg  task  sets 
the  response  bias.  This  is  not  s  feature  of  the  memory 
span  task,  so  it  is  missing  as  well.  Corresponding 
productions  are  not  ell  Identical.  The  encoding 
productions  (PD8  -  Phil)  are  trie  same.  However,  the 
decoding  productions  (?D2  -  PDS)  are  responsive  to 
REPEAT  rather  than  to  (PROBE).  To  make  them  identical 
would  require  another  level  of  indirectness — one  that 
might  be  expected  perhaps  in  the  early  stages  of  per¬ 
formance  (When  the  subject,  in  effect,  must  interpret 
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Fig.  21.  PS.MS2:  PS  for  iwuory  span,  vith  encoding. 
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the  signal  In  terms  of  a  common  meaning— to  decode) , 
but  would  presumably  be  adapted  out  vlth  practice^ 
Finally,  PD1,  which  recognises  Che  end  of  the  task,  is 
responsive  to  different  features  in  the  two  tasks.. 

Figure  22  shows  a  run  of  PS.HS2  on  a  three  element 
sequence,  which  can  be  seen  to  perform  appropriately. 

Let  us  summarize.  Substantively,  we  have  found 
that  the  encoding  hypothesis  is  not  only  consistent 
with  behavior  in  another  distinct  task,  but  provides 
an'  appropriate  -solution  to  a  difficulty  (the  ordering) 
that  arises  from  Che  application  of  a  naive  formulation. 
We  showed,  however,  that  it  was  not  the  oftly  way  to 
overcome  the  difficulty..  Some  of  the  alternatives, 
despite  our  disparagement,  clearly  represent  alterna¬ 
tives  to  be  considered  further.  We  indicated  some 
other  tasks  in  which  they  appear  to  operate.  Never-  . 
theless,  the  encoding  hypothesis  comes  through  appear¬ 
ing  subatantially  lea*  ad  hoo. 

■  ..  Methodologically, -we  say  that  it  was  relatively 
cauyt  to  move  to  a  new  task  and  to  construct  a  theory 
that  had  substantial  contact  with  the  initial  one. 

With  a  little  care  one  could  Insist  that  exactly  the 
same  theory  (i.e.,  the  same  total  production  system) 
be  abla  to  perform  both  tasks.  To  be  sure,  some  of 
the  productions  will  be  unique  to  each  task.  Indeed, 
they  must  bo  if  the  unique  aspects  of  a  task  ere  to 
be  represented. 

In  seeking  support  for  the  decoding  hypothesis  in 
the  phenomenon  of  response  order  we  have  taken  the 
structure  of  the  STH  to  be  fixed.  As  we  observed 
earlier,  it  is  the  last-in  firat-out  character  of  the 
STH  that  creates  this  problem  and  mokes  it  a  fundamental 
one.  Alternatively,  the  solution  might  lie  in  changing 
the  structure  of  the  underlying  system.  One  can  cer¬ 
tainly  construct  STM  models  that  have  a  first-in  first- 
out  character  and  thus  make  the  response  order  identical 
to  Input  order.  However,  such  systems  must  ultimately 
have  other  problem*.  For  the  underlying  empirical  real¬ 
ity  la  Chat  humans  appear  to  behave  in  positive  tine 
order  (first-in  first-out)  in  the  short  run  and  in 
inverse  time  order  (last-in  first-out)  in  the  long  run. 
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Fig.  22.  Sun  of  PS.HS2. 
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Thus,  there  Is  a  reversal  at  sense  stage  (from  primacy 
to  recency,  if  you  like  to  think  of  it  that  uay)  and 
the"  Structure  of  the  system  oust  account  for  both 
aspects- 


Applications  of  the.  tbsery 

We  have  now  developed  a  theory  of  the  simple 
Sternberg  binary  classification  task  that  has  nodest 
standing.  It  should  be  possible  to  apply  it  to  the 
experiments  discussed  in  this  ./agios  iua  that  make  use 
of  similar  task  situations*  To  do  this  properly  re¬ 
quires  that  we  extend  the  theory  to  these  variant 
situations,  catch  arwa  did  to  ths  memory  span  task* 
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keeping  as  much  communallty  with  the  original  situation 
as  possible.  However,  there  is  a  limit  to  an  intro¬ 
ductory  paper  and  to  go  into  the  results  of  Posner 
(Chapter  2)  and  Hayes  (Chapter  4)  in  detail  exceeds 
those  limits.  Thus,  we  must  be  content  with  a  cursory 
examination  of  a  few  aspects.  Methodologically,  we 
can  make  a  virtue  of  this  restriction,  since  it  pro¬ 
vides  the  opportunity  to  apply  the  theory  in  a  qual¬ 
itative  way,  thereby  illustrating  how  such  applications 
might  go. 

Perceptual  Knhancement 

The  brief  discussion  in  Posner's  paper  on  the 
phenomenological  experience  of  perceptual  enhancement 
of  the  successful  item  in  a  Nelsser  paradigm  offers  a 
simple  example.  He  observes  that  Cavanagh  and  Chase 
(1971)  found  that  in  a  Sternberg  task  with  two  probes 
(one  positive,  one  negative)  the  positive  one  only  was 
enhanced.  Posner's  argument  was  that—this  controverted 
the  use  of  the  enhancement  as  an  indicator  of  the 
boundary  between  pro-attentive  and  attentive  processes, 
since  much  attentive  processing  (i.e.,  the  search)  went 
on  prior  to  the  enhancement  and  did  so  for  both  probes. 

The  present  model  offers  a  somewhat  different 
characterization.  Presenting  two  probes  rather  than 
one  has  no  effect  on  the  linear-time  component,  which 
is  the  decoding  time.  It  might  have  an  effect  on  the 
intercept  if  the  two  probe9  arc  themselves  encoded  in 
some  way,  or  enter  STM  serially.  One  and  only  one  of 
the  probes  evokes  the  positive  production  (PD6) .  The 
other  probe  simply  does  not  evoke  anything.  Thus  a 
single  decoding  operates  for  both  probes,  14 

Examination  of  the  production  system  puts  the 


14The  actual  slopes  are  somewhat  higher  than  the 
usual  35  ms.  This  complicates  the  interpretation.  It 
suggests  (as  only  one  among  several  alternatives)  that 
some  subjects  may  have  processed  each  probe  separately 
and  that  the  data  represent  a  mixture  of  methods. 
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enhancement  effect  on  Pl)b,  which  is  to  say  on  the 
multiple  occurrence  of  a  variable  in  the  matching. 

This  offers  a  clue  about  how  one  might  explore  the 
details  of  the  match  processes.  However,  the  present 
model  doe9  not  offer  a  clear  interpretation  of  pre- 
attentive  versus  attentive  processes.  First  of  all, 
the  model  does  not  Include  a  perceptual  component  so 
that  one  can  determine  whether  the  match  is  or  is  not 
part  of  the  same  apparatus  that  carries  out  perception. 
No  matter  how  one  determines  the  latter  question,  the 
match  (the  selection  of  the  next  production),  and  hence 
the  enhancement,  is  involved  intimately  with  whatever 
can  be  called  attentive  processes. 15 

Having  gone  this  far,  it  is  tempting  to  state  a 
hypothesis  about  the  locus  of  conscious  experience. 

It  is  not  to  be  associated  with  the  content  of  any 
memory,  not  even  of  STM  whi''\  defines~In  an  operational 
sense  what  the  subject  is  momentarily  aware  of,  l.e., 
to  what  he  can  respond  to  in  the  next  tens  of  milli¬ 
seconds.  Rather,  phenomenal  consciousness  is  to  ba 
associated  with  the  act  of  matching,  and  its  content 
Is  given  by  the  sec  of  STM  items  extracted  by  the 
matched  condition.  Thus,  it  is  an  ephemeral  fleeting 
thing  that  never  stays  quite  put  and  never  seems  to 
have  clearly  defined  edgc9  (the  never-step-into-the- 
same-river-twlce  phenomenon).  It  seems  like  an  inter¬ 
esting  hypothesis.  That  the  hypothesis  can  be  stated 
in  such  a  precise  form  is  attributable  to  having  a 
detailed  model  of  the  control  structure. 

Recency  Effects 

Posner's  paper  discusses  several  Sternberg- like 
tasks  in  detail.  A  prominent  feature  of  his  data  is 


15The  diffusenesa  of  this  discussion  only  shows  that 
each  theory  puts  its  own  classification  on  phenomena 
and  one  cannot  easily  discuss  one  in  terms  of  the  other 
(attentive  versus  pre-attentive  derive  from  a  certain 
rough  model  of  the  total  machinery) . 
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tht  non-linear  relation  to  positive  set  size.  This 
leads  him  to  plot  all  of  his  graphs  against  the 
logarithm  of  sel  size,  since  this  tends  to  linearize 
the  curves  somewhat.  This  decision  of  how  to  display 
the  data  makes  me  uncomfortable,  X  confess,  since  it 
seems  not  to  be  theoretically  motivated.  In  fact  it 
serves  to  obscure,  rather  than  clarify  the  explanation 
Posner  provides  in  passing.  He  notes  that  the  effect 
■ay  be  a  recency  efface  on  the  first  Item,  namely,  that 
subjects  respond  more  quickly  to  sets  of  size  one  than 
to  larger  sets.  If  this  Is  so,  then  the  curves  should 
fas  linear  for  set  sizes  greater  than  one.  However,  all 
the  data  are  limited  to  three  sizes,  1,  2  and  4,  and 
thua  no  direct  empirical  test  of  this  is  possible. 

This  recency  phenomenon  appears  to  be  not  unknown 
elsewhere  in  the  literature  on  Che  Sternberg  task  and 
seems  to  be  associated  with -dynamic  presentation — 
defining  the  set  just  prior  to  test— with  a  relatively 
short  delay  between  set  definition  and  -probe,  Posner's 
experiments  fit  this  format,  since  they  run  from  set 
to  probe  continuously  (at  half  second  pacing)  and 
without  warning. 

An  explanation  is  not  far  to  seek  within  the 
present  theory,  consisting  of  both  the  production 
system  framework  and  the  decoding  hypothesis.;  With 
set  size  of  one  the  system  delays  encoding  until  the 
second  element  arrives.  If  Instead  the  probe  arrives, 
then  there  is  no  decoding  step;  rather,  the  system 
simply  responds.  In  fact,  if  one  runs  the  full  range 
of  aet  sizes  one  finds  the  recency  effect.  From  the 
formule  given  earlier,  which  expresses  the  correct 
linear  growth,16  one  gets: 

T(l)  «  3*T. evoke  +  6*T. action  +  1*T. action 
»  3*T. evoke  +  7*T.  action 


l6In  deriving  that  formula  we  simply  did  not  reflect 
the  special  circumstances  of  the  special  case.  A  care¬ 
ful  enough  analysis  would  have  revealed  it,  of  course, 
and  perhaps  the  perspicacious  reader  in  deriving  it 
independently  detected  the  flaw. 
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The  measured  value  is; 

T(l)'  »  2*T. evoke  +  5*T. action 

This  provides  a  difference  of  T. evoke  +  2*T. action, 
which  is  something  in  excess  of  70  ms,  taking  the  35  ms 
figure  for  T. action.  This  is  somewhat  high  for  the 
measured  values,  which  run  40  -  60  as .  As  with  the 
discrepancy  on  the  response  bias,  we  do  not  know  whether 
of  not  to  be  disturbed  by  the  approximate  fit.  Basi¬ 
cally,  the  ambiguity  of  Interpretation  arises  because 
the  experimental  numbers  are  averages  over  trials  and 
over  subjects.  This  means  ney  are  undoubtedly  gener¬ 
ated  by  mixtures  of  strategies  to  some  unknown  extent. 

Posner's  Figure  2  shows  a  strong  serial  position 
effect  for  a  set  size  of  four.  This  is  a  recency  effect 
in  which  the  last  item  {the  fourth)  is  processed  about 
50  ms  faster  than  the  other  three,  which  are  reasonably 
constant.'  Our  theory  as  it  stands  does  not  handle  this, 
since  it  produces  the  recency  phenomenon  only  for  setn 
of  one.  Ue  can  extend  it  to  the  new  situation,  however, 
if  we  assume  that  the  subject  can  react  to  the  last 
element  directly,  even  though  he  has  also  encoded  it. 

The  size  of  the  effect  indicates  that  this  happens  some¬ 
times,  but  not  always,  so  that  the  data  would  be  a 
mixture  of  two  ways  of  doing  the  task.  If  this  is  the 
explanation,  we  should  also  find  recency  effects  for 
the  ocher  set  sizes. 

In  general  teems,  such  ,  explanation  is  consistent 
with  the  nature  of  production  systems.  There  is  no 
reason  why  the  responding  production  (PD6)  should  not 
pick  up  the  data  of  the  unencoded  element  directly. 

In  fact  the  ability  to  short  circuit  a  longer  process 
and  to  mix  methods  would  9eem  to  be  a  major  point  in 
favor,  of  production  systems,  providing  a  detailed 
explanation  for  variety  and  lability  of  behavior. 
However,  as  our  experience  on  the  several  production 
systems  should  indicate,  it  may  not  b«  trivial  to  con¬ 
struct  the  production  system  to  get  the  recency  result. 
He  may  find  that  it  works  just  as  well  on  all  members 
of  the  set,  if  we  fix  it  up  to  work  on  the  most  recent. 
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Wnercas  recency  seems  consistent  with  the  unreliability 
assumption  of  STM,  so  that  the  subject  night  trust  the 
most  recent  one  but  not  the  older  ones,  the  system  may 
not  be  able  to  Loll  the  two  situations  apart.  We 
mentiou  these  potential  difficulties  to  indicate  the 
gap  between  having  the  right  sort  of  theory  and  having 
it  deliver  the  right  predictions  in  detail. 

Continuous  Sternberg  Experiment 

Enough  work  has  been  done  with  the  Sternberg  para¬ 
digm  to  accumulate  a  number  of  experiments  whose  inter¬ 
pretation  appears  to  pose  extreme  difficulties.  One  of 
these  is  an  experiment  by  Sternberg  and  Scarborough 
(1969).  Unfortunately  it  has  not  been  replicated  nor 
extended,  but  it  is  still  worth  attempting  an  explana¬ 
tion  in  terms  of  the  present  theory. 

Briefly,  a  subject  was  given  a  fixed  positive  set. 
Then  he  was  tested  with  20  probes  in  sequence.  Exactly 
one  probe  was  positive  or. none  was.  The  time  between 
probes  was  70  ms,  so  the  entire  sot  of  20  probes  went 
by  in  under  1.5  seconds.-  The  subject  was  to  react  to 
the  positive  probe  in  the  usual  way.  The  result:  the 
reaction  time  was  identical  to  chat  in  the  basic  task, 
being  a  linear  function  measured  from  the  time  of  the 
probe,  with  a  slope  of  about  35  ms  and  an  intercept  of 
about  350  ms. 

This  result  is  extremely  difficult  for  search 
theories  to  deal  with.  Sternberg  and  Scarborough  erect 
an  ad  hoo  pipeline  processing  system  with  stages  for 
each  probe.  The  present  theory  produces  the  essential 
result  on  the  assumption  that  the  probes  trigger  the 
decoding  of  the  set,  thus  filling  STM  with  both  probes 
and  elements.  Due  to  the  unreliability  of  STM,  if  a 
hit  gets  made,  the  set  is  decoded  again  to  confirm  the 
hit. 

Figure  23  gives  a  production  system,  FS.CST1,  for 
the  continuous  Sternberg  task.  It  differs  somewhat, 
as  it  must,  from  FS.ST7,  the  production  system  for  the 
basic  task.  We  have  kept  the  names  of  productions  the 
same,  so  that  the  correspondence  is  evident.  Mostly, 
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Fig.  23.  PS.CST1:  PS  for  continuous  Sternberg  task. 


productions  drop  out.  Since  the  subject  has  the  6et 
in  LTM,  no  encoding  productions  are  needed  (though  they 
could  have  been  left  in  the  system).  Instead,  PD12  is 
modified  to  put  the  positive  set  into  STM,  either  on 
the  ready  signal  or  whenever  there  is  an  indication 
that  some  elements  Bight  be  lo3t  from  STM.  The  cues 
to  this  are  there  not  being  any  set  in  STM,  either 
undecoded — (SET)  ABS — or  decoded — (OLD  (SET))  ABS.17 
Thus,  the  system  dumps  sets  into  STM  at  every  indica¬ 
tion,  so  to  speak,  in  an 'attempt  to  avoid  losing  some 
elements  of  the  positive  set  from  SIM. 

Decoding  of  a  set  takes  place  whenever  there  is  a 
aet  in  STM  to  be  decoded  and  a  probe  to  initiate  it. 
Since  there  is  a  continuous  stream  of  probes  (once  they 
start) ,  decoding  takes  place  immediately  (and  produces 
snail  refractory  periods)..  The  task  itself  dictates 
the  removal  of  the  negative  response  production  (PD7) , 
since  the  vest  is  only  for  presence.  (Actually,  the 
production  system  could  have  been  expanded  to  say  NO 
at  the  end  of  the  sequence.)  The  positive  response 
production  (PD6)  is  modified  to  only  sense  an  identical 
probe  and  set  element  with  a  marked  probe  (with  MARK)., 
The  key  production  is  Pj,l.lt  which  responds  to  an 


I7The  vigilant  reader  will  notice  an  error  in  the 
figure,  namely  the  AND  missing  between  two  condition 
elements  of  PD12.  The  interpreter  does  not  in  fact 
require  the  AND.  Thus  it  behaved  correctly,  eo  that 
the  error  uas  not  noticed  until  later. 
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Fig.  24.  Sun  of  PS.C3T1 
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identical  probe  and  set  element  by  parking  the  probe 
and  reinitializing  the  positive  set.  this  realizes 
the  checking  assumption. 

Figure  24  shows  a  run  of  PS.CST1  with  a  two  element 
set,  consisting  of  (ELM  4),  to  be  matched  to  the  probe, 
and  (ELM  A) ,  the  irrelevant  one.  The  executive  for  the 
run  was  modified  so  that  it  erne  to  the  console  on 
almost  every  other  action.  At  35  a*  per  action,  this 
approximated  a  70  am  interstimulus  duration.  The 
experimenter  forced  an  element  into  STM  at  each  of 
these  times,  starting  with  READ!  and  than,  after  a 
slight  wale,  a  sequence  of  probes.  Examination  of  the 
run  shows  that  It  reacts  to  (PKOBE  4)  appropriately, 
narking  It,  going  through  another  decode  and  responding 
YES,  despite  the  fact  that  other  probes  are  being 
entered  throughout.' 

The  system  deals  with  the  main  effect  In  an 
appropriate  way.  It  would  appear  to  have  a  slightly 
higher  intercept,  which  was  not  found  in  Che  experiment. 
However,  this  is  an  uncertain  measure,  since  the  abso¬ 
lute  value  of  the  intercept  is  always  contaminated. 

Also,  a  somewhat  higher  error  rate  might  be  expected, 
due  to  the  chances  uf  missing  the  match  with  PD1.1  if 
the  probe  arrives  and  STM  has  just  lost  the  key  set 
element.  However,  experimentally  the  error  rate 
remained  low.  It  is  possible  that  the  scheme  of  PS.CST1 
it  in  fact  relatively  reliable,  but  it  requires  more 
exploration  than  has  been  dons. 

A  Difficult  Exparimnt 

The  impression  should  net  be  that  the  theory  is 
unchallenged.  The  total  set  of  Sterr.nsrg-like  exper¬ 
iments  is  too  diverse  for  that.  For  inatv.ee ,  the 
theory  appears  to  have  great  difficulty  with  enocher 
experiment  reported  by  Sternberg  (1970).  The  positive 
set  (digits)  is  stored  in  LTX  and  its  transmission  into 
STM  is  held  in  abeyance  by  an  auxiliary  STM  task  of 
remembering  a  sec  of  letters.  Sometimes  the  subject 
gets  a  probe  digit  co  classify  aa  in  the  positive  set 
or  not.  Sometimes  he  gets  a  signal  to  repeat  the  letter 
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set,  which  helps  to  assure  that  he  attends  to  rhe 
letter  set  prior  to  the  signal.  The  result  Is  a  slope 
of  about  twice  that  of  the  formal  paradigm  (which  was 
run  as  a  control) — namely,  80  ms  versus  40.  The  inter¬ 
cept  is  also  higher  by  about  1*00  ms  in  the  experimental 
situation. 

Sternberg  interprets  tbe  higher  slope  as  being  due 
to  the  time  to  transmit  the  post  five  set  from  LTM  to 
STM,  which  is  a  close  analog  of  the  decoding  hypothesis. 
The  difficulty  Tor  the  present . theory  is  that,  if  this 
is  s  decoding,  then  the  slope  should  be  exactly  the 
same  as  in  the  control  esse,  since  both  have  Involved 
one  act  of  decoding.  Alternative  interpretations  ore 
always  passible,  but  none  has  occurred  that  cooes  close 
'to  resolving  this  experimental  result. 

Conclusion 

Let  us  sum  up  what  we  have  done  in  this  paper. 

(1)  We  introduced  the  notion  of  a  control  structure. 

(2)  We  introduced  a  general  class  of  systems — 
production  systems — that  could'serve  as  models  of  the 
human  control  system'.  (3)  We  developed  in  detail  a 
specific  production  system— PSG — which  incorporated 
assumptions  about  the  structure  of  the  human  infor¬ 
mation  processor.  (4)  We  exercised  the  theory  on  the 
basic  Sternberg  binary  classification  experiment,  which 
led  to  an  additional  psychological  assumption — the 
decoding  hypothesis.  (5)  We  pursued  in  lesser  detail 
some  other  applications— the  memory  span  and  some 
aspects  of  the  experiments  in  Posner's  paper. 

Our  intsnt  throughout  has  been  jointly  substantive 
and  methodological  and  we  have  nixed  the  rvo  thoroughly. 
In  the  remainder  of  the  conclusion  we  will  attempt  to 
sort  out  the  main  points  and  Issues. 

Production  System  as  Theories 

Production  systems  offer  an  explanation  of  human 
behavior  at  the  information  processing  level  (Newell  & 
Simon,  1972).  They  ere  only  one  of  many  forms  of  pro¬ 
graming  system  that  can  be  used  to  describe  behavior 
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-in  infotnatiniv  processinfi  terns.  As  wo  have  ooen  in 
PSG,  the -production  system  itself  has  become  the  car¬ 
rier  of  tlie  basic’ psychologi cal  assumptions — tha  system 
architecture  of  1'SG  i.i  taken  to  be  the  system  archi¬ 
tecture  of  the  human  information  processing  syscea. 

Itr  this  roi  poet'  these  systems  represent  an  evolution 
beyond  programming  language  systems,  such  as  LISP,  1PL, 
SNOBOL  food  even  more,  ALGOL  and  FORIKAM) .  In  these 
earlier  ayetcas  the  programming  language  was  an  essen¬ 
tially  neutral  affair,  designed  for  the  user  to  write 
his  specific  systems.  Introduction  systems,  as  rep- 
presented  by  PSG,  any  particular  set  of  productions 
represents  a  possible  momentary  performance  organisation 
of  a  human  subject. 

The  evolution  to  a  theory-laden  programming  lan¬ 
guage,  to  use  a  term  of  Fylyshyn,  appears  to  me  a 
major  advance.  By  the  name  coin,  however,  tha  language 
is  not  neutral,  so  that  variations  in  the  psychological 
theory  imply  variations  in  the  programming  system,  A 
moment's  reflection  will  show  how  vide  is  the  potential 
variation  in  system  architecture.  The  STM  can  be  run 
according. to  many  disciplines:  last-in  first-out,  as 
now;  first-in  first-out,  which  preserves  order;  random 
replacement  in  a  fixed  set  of  addressable  colls;  a  cir¬ 
culating  loop,  which  provides  another  font  of  rehearsal, 
etc.  The  matching  rules  can  be  varied:  re  multiple 
variables  in  the  condition;  only  single  levels  in  the 
condition  (not  nested  expressions);  no  recognition  of 
absence;  etc.  The  operations  can  be  varied:  a  decoding 
operation  that  simply  dumps  the  contents  into  STM, 

7ather  than  the  encoding  operation  os  now;  etc.  The 
selection  of  productions  can  be  varied:  more  than  one 
satisfied  production  producing  s  psychologically  mean¬ 
ingful  conflict  state;  evocation  of  a  production  leading 
to  an  automatic  refractory  state  that  inhibits  re¬ 
evocation  immediately;  etc.  The  timing  model  can  be 
varied:  parallel  processing  in  the  action  sequence; 
matching  time  dependent  on  the  elements  in  the  satis¬ 
fied  condition. 

Listing  many  alternatives  emphasizes  that  FSG  is 
only  one  member  of  the  class  of  psychologically  relevant 
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piixlu-  t  i.ua  systems.  Despite  till;:  variety,  production 
systems  at;  a  c'J.ism  incorporate  some  psychological 
assumpl ions  dial  seem  highly  plausible.  One  is  the 
reeogn i /e-act  cycle  of  activity  in  which  the  human 
continually  ivcogni zcs  some  features  in  the  situation 
and  act;:  accordingly.  Another  is  making  the  locus  of 
the  condition  correspond  to  those  aspects  of  the  sit¬ 
uation  that  the  subject  is  momentarily  aware  of,  and 
the  ident if i cation  of  this  .as  the  relevant  short  term 
memory.  Yet  another,  though  it  applies  to  a  somewhat 
narrower  class  of  systems,  is  theTncorporation  of 
encoding  into  all  STM  processing,  not  simply  as  an 
added  mechanism. 

The  structure  of  production  system  models,  as 
we  have  described  them  here,  are  seriously  deficient 
in  several  respects.  They  do  not  model  the  perceptual 
component,  including  the  various  buffer  memories  and 
the  control  interface  between  perceptual  structures 
and  the  contents  of  STM  (see  Newell,  1972).  They  do 
not  model  LTM,  especially  the  acquisition  of  new  infor¬ 
mation.  We  took  the  concents  of  LTM  as  consisting  of 
productions,  but  never  defined  the  way  new  productions 
were  to  be  created.  They  do  not  model  the  motor  appa¬ 
ratus,  including  the  control  interface  to  the  contents 
of  STM  and  the  actions  of  productions.  These  missing 
aspects  cripple  the  model  with  respect  to  many  phenom¬ 
ena,  though  there  is  no  reason  why  the  model  should  not 
be  extended  appropriately. 

Completeness 

Production  systems,  like  other  programming  systems 
and  mathematical  theories,  are  complete  in  the  sense 
of  producing  theoretical  consequences  that  arc  deduc¬ 
tions  from  the  theory.  We  are  interested  also  in  com¬ 
pleteness  of  another  sort.  Is  the  theory  complete  for 
the  phenomena  of  interest?  Does  it  provide  a  vehicle 
of  sufficient  richness  and  scope  to  model  what  appears 
to  need  modeling?  Production  system  models,  like  other 
so-called  simulation  models,  seem  to  have  this  complete¬ 
ness.  This  is  often  expressed  by  saying  that  they  per¬ 
form  what  they  model.  Thus  PS.ST7  not  only  is  a  theory 
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of  binary  i  I  ass  j  f  i  cation;  it  can  do  binary  classifi¬ 
cation.  As  long  as  the  interest  of  the  psychologist 
remains  focussed  on  the  performanco-of  the  task,  includ¬ 
ing  its  behavioral  details,  a  production  theory  claims 
theoretical  coverage  (though  of  course  it  can  he  dead 
wrong  in  its  predictions). 

It  is  useful  to  compare  this  situation  with  some 
of  rhe  other  techniques  we  currently  use  for  describing 
our  processing  theories.  As  commented  upon  in  the 
companion  paper  (Newell,  this  volume,  Chapter  6),  the 
theoretical  structure  of  work  on  the  immediate  pro¬ 
cessor  has  been  dominated  by  the  classification  of 
mechanisms.  We  have  serial  versus  parallel,  exhaustive 
versus  self-terminating,  attentive  versus  preattentive, 
and  so  on.  Such  terms  hold  low-level  generalizations 
resulting  from  the  experimental  studies.  Suppose 
PS.ST7  were  the  actual  mechanism.  Is  the  human,  then, 
a  serial  or  a  parallel  system?  It  appears  to  be  para¬ 
llel  on  selecting  productions,  serial  on  executing 
micro-sequences  of  actions,  parallel  on  examining  STM, 
serial  on  the  order  of  that  parallel  examination  as 
revealed  by  shielding  of  one  STM  clement  by  another. 

Is  its  search  exhaustive  or  self-terminating?  Within 
a  given  task  there  are  production  systems  of  each  type. 
Slightly  more  complex  systems  would  yield  strategies 
that  mix  the  type  of  search  conditionally  within  a  given 
trial.  Is  something  pre-actentive  or  attentive?  We 
found  it  hard  to  ascertain  that  aa  well.  The  point  is 
not  that  a  given  system  docs  not  give  rise  to  classifi¬ 
cations.  The  present  system  has  sharp  distinctions, 
e.g. ,  between  the  use  of  STM  and  of  the  variable  memory, 
or  between  sequences  of  actions  and  the  evocation  of  a 
sequence  of  recognitions  on  STM.  The  point  is  that  the 
existing  classifications  don't  seem  to  help  much  in 
describing  more  complete  systems. 

Flow  diagrams  have  become  a  primary  vehicle  for 
expressing  theories  of  processing,  and  they  represent 
a  substantial  advance  on  the  simple  classification  of 
mechanisms.  There  is  an  example  in  the  paper  by  Cooper 
and  Shepard  (Chapter  3)  in  the  present  symposium,  which 
summarizes  well  a  processing  structure  that  might  give 
rise  to  their  experimental  results. 
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What  is  tin-  relationship  between  production  systems 
and  flow  diagrams  as  they  are  used  in  the  psychological 
literature?  The  flow  diagram  provides  n  precise  model 
of  control  flow — of  what  follows  what.*0  It  provides 
a  frame  within  which  informal  specification  of  oper¬ 
ations  can  be  made  (the  little  descriptive  phrases  that 
go  in  the  boxes) .  It  does  not  provide  any  way  of  dis- 
cipiinlng  the  structures  so  built  up.  As  noted,  the 
operations  themselves  are  informal.  Sometimea,  as  In 
some  of  the  diagrams  in  Sternberg  (1970) ,  the  boxes 
appear  so  elementary  as  to  be  well-defined  (c.g. ,  a 
comparator,  a  match  register,  etc.),  but  in  fact  the 
flow  diagram  still  remains  Informal. 

More  important  from  the  present  view,  there  la  no 
discipline  on  the  control  structure.  There  are  neither 
primitives  of  control,  nor  ways  of  determining  that 
additional  apparatus  or  processing  must  occur  to  effect 
control.  The  effect  of  this  is  to  make  the  flow  dia¬ 
gram  unique  to  each  task.  It  must  of  course  be  unique 
in  some  way  since  the  tasks  arc  different.  But  there 
is  then  no  way  to  assert  when  two  different  flow  dia¬ 
grams  represent  the  same  processing  mechanism. 

The  production  system,  on  the  other  hand,  provides 
a  complete  set  of  primitives  and  determines  what  auxil¬ 
iary  control  processing  is  necessary  to  perform  a  task. 
This  comparison  between  tasks  is  possible.  This  is  not 
a  peculiar  property  of  production  systems,  of  course, 
but  is  true  of  any  programming  system.  Writing  programs 
In  SNOBOL  or  FORTRAN  would  do  as  well,  methodologically, 
except  that  their  underlying  structure  does  not  mirror 
reasonable  psychological  assumptions  about  the  human 
system  architecture. 

The  virtue  of  the  flow  diagram  is  that  It  expresses 
clearly  the  independence  and  ordering  of  stages  derived 
experimentally  by  careful  design  (e.g.,  Sternberg,  1969). 
Flow  diagrams,  by  their  very  Incompleteness,  do  not 


* °Bcsides  flow  diagrams,  which  show  control  flow, 
block  diagrams,  which  show  data  flow,  are  also  used. 

The  remarks  of  this  section  apply  equally  well  to  both. 
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over-commlt  cliei r  user  to  more  thou  what  the  data  say. 
Thus  they  are  good  for  sumn.-ir i  z  i  ng  experimental  data, 
at  Lhe  Name  time  that  they  arc  weak  for  constructing 
theory. 

j'ne  Problem  of  Methods. i 

Variability  over  subjects  comes  In  large  part  from 
the  variation  in  the  methods  (strategy,  program,  ...) 
they  use  for  a  task.  Thit.  is  conjectural,  of  course, 
but  much  evidence  supports  it.  A  major  contribution 
of  a  detailed  theory  of  control  is  to  make  possible 
the  proper  posing  of  the  question  of  what  method  a 
subject  used  for  a  given  task.  It  does  this  by  provid¬ 
ing  the  space  of  all  methods  (based  on  the  constants 
of  system  architecture  and  the  primitive  operations) 
for  a  subject.  Thus,  the  problem  of  discovering  the 
method  takes  the  form  of  a  programming  problem.  As  we 
illustrated,  there  are  often  many  solutions,  i.e.,  many 
production  systems  that  perform  the  task,  but  these 
can  be  generated  and  analysed,  and  scientific  reasons 
found  for  selecting  one  over  another  within  the  limited 
set.  This  is  a  quite  different  situation  chan  currently, 
where  anything  seems  possible  in  discussing  what  might 
go  in  a  subject's  performance. 

This  formulation  of  the  problem  of  methods  comes 
not  just  from  the  use  of  a  precise  language  (e.g.,  a 
simulation  language).  It  comes  from  the  identification 
of  the  space  of  all  programs  defined  by  the  system  with 
the  space  of  all  programs  feasible  for  the  subject. 

A  theory  of  control  is  more  important  to  analyzing 
methods  than  just  another  aspect  of  the  total  system 
necessary  to  complete  specification.  Much  of  what  goes 
on  in  information  processing  is  control.  Almost  every 
operation  in  a  large  complex  program  does  nothing  except 
arrange  things  so  something  else  can  do  something.  This 
appears  to  hold  for  both  humans  and  computers.  Tor 
Instance,  Dansereau  (1969)  found  it  to  be  true  of  humans 
doing  mental  multiplication  (e.g.,  36  x  152).  The  times 
for  the  additions  and  multiplications — the  productive 
part  of  the  process,  so  to  speak — played  a  small  role 
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compared  to  Liu*  tinea  for  fixation,  operand  positioning, 
etc.  The  same  is  certainly  true  of  the  theory  us  devel¬ 
oped  in  this  paper.  The  decoding  hypothesis  Is  In  fact 
a  form  of  the  same  magicians  trick,  in  which  the  actions 
that  take  time  are  not  the  apparently  productive  part 
(the  iterated  test  for  identity),  but  a  preparatory 
piece  of  housekeeping.  In  short,  methods  ore  mostly 
control,  so  that  any  theory  of  methods  must  operate 
within  an  explicit  theory  of  control. 

The  Problem  of  Scope 

How  to  construct  theories  that  range  over  a  wide 
diversity  of  tasks  is  a  major  issue  for  psychology. 

To  do  so  would  seem  to  require  a  theory  that  was 
specific  about  those  aspects  of  structure  and  content 
that  in  fact  were  used  in  common  in  diverse  tasks.  A 
detailed  theory  of  the  control  structure  would  seem 
to  offer  this,  since  it  specifies  the  common  archi¬ 
tecture  and  the  boundaries  within  which  a  taok-speci f i c 
method  can  be  sought. 

The  evidence  we  have  presented  that  production 
systems  will  indeed  make  a  major  contribution  to  this 
issue  is  still  meager.  In  thi3  paper  we  applied  the 
theory  only  to  a  couple  of  tasks.  The  original  pro¬ 
duction  system  was  applied  to  a  puzzle,  a  much  vaster 
task  than  any  discussed  here,  and  there  are  some  other 
applications  in  Newell  (1972).  The  PSG  production 
system  by  Kiahr  (Chapter  11)  In  this  volume  provides 
one  more  example. 

All  these  efforts  provide  evidenclTTanly  about  half 
the  issue.  They  show  that  it  is  relatively  easy  to 
construct  a  theory  in  a  new  task  environment  that  is 
responsive  to  the  empirical  issues  in  that  environment. 
One  obtains,  us  well,  strong  comparability.  For 
instance,  Kiahr’ s  counting  production  system  can  be 
examined  in  conjunction  with  the  Sternberg  one  here. 

In  an  important  sense  they  are  the  same  system,  since 
they  both  use  PSG  and  therefore  make  the  same  assump¬ 
tions  about  underlying  structure.  However,  the  con¬ 
stants  of  the  time  model  differ.  Kiahr  also  uses 
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replacement  operators — (X  “■*>  Y)  replaces  the  symbol  X 
in  an  element  with  the  symbol  Y — whereas  the  model  here 
uses  only  the  encode  operator,  (**) .  This  loads  to  a 
quite  different  style  of  programming.  Some  of  his 
conditions  are  very  long  and  raise  question;;  about 
whether  constraints  should  exist  on  the  size  or  com¬ 
plexity  of  conditions. 

This  collection  of  production  systems  does  not 
constitute  a  coherent  theory  for  the  set  of  tasks 
involved.  To  do  so,  they  must  be  melded  together  into 
a  single  production  system  that  performs  all  the  tasks, 
corresponding  to  the  total  organization  of  a  single 
human.  Such  a  production  system  will  have  productions 
that  are  unique  to  each  task.  But  it  must  face  scrutiny 
about  using  disparate  mechanisms  for  common  operations. 
It  must  also  handle  the  instructional  problem,  since 
something  in  the  environment  must  select  out  the  per¬ 
formance  relevant  to  Che  tank  at  hand.  The  interaction 
of  the  instructions  with  the  task  performance  program 
In  as  much  centra]  to  control  as  tho  Internal  part  of 
the  performance  program.  It  is  predictable  that  a  full 
fledged  theory  of  task  Instruction  will  be  required. 

I  stress  the  creation  of  a  single  production  sys¬ 
tem  to  i u|H  uduiii  the  unified  performance  on  a  sot  of 
tasks.  This  seems  to  me  the  only  way  to  validate  a 
iliuory  of  control-  Wo  saw  in  tho  dJeeuealun  of  the 
bnsic  Sternberg  paradigm  that  many  degrees  of  freedom 
were  available,  though  they  showed  up  as  alternatives 
in  method,  rather  than  freedom  of  parameter  settings. 
This  arises  primarily  because  the  datum  taken  from  s 
single  trial  Is  so  small  {i.e-y  overall  ttittioii  time) 
compared  to  the  complexity  of  the  system  that  generates 
it.  To  compensate,  behavior  In  many  disparate  task* 
must  be  obtained,  so  that  finally  the  mechanisms  and 
methods  being  used  become  uniquely  identified.  My  own 
personal  estimate  is  that  a  model  of  the  control  struc¬ 
ture  should  claim  to  handle  some  dozens  of  diverse 
experiments  before  it  is  a  genuine  contender.  The 
present  theory,  though  promising,  still  has  a  ways  to 
go. 
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It  should  be  noted  In  passing  that  the  theory 
refers  to  individual  performance  with  a  specific 
Method.  .  Thus  all  forms  of  aggregation  raise  the  spectre 
of  averaging  over  disparate  methods,  hence  producing 
Mixed.' estimates.  Thus  one  is  driven  towards  collecting 
^md  reporting  data  only  on  individual  subjects,  and 
even  there  not  averaging  disparate  perf ornancea . 

The  Prospeote  for  thio  Partiaular  Theory 

As  noted,  .the  present  theory  Is  only  nascent.  A 
few  words  might  be  said  shout  its  prospects.  Hissing 
from  the  model  as  it  stands  is  a  theory  of  error.  The 
theory  makes  only  time  predictions.  Errors  are  Indeed 
possible  in  the  system,  due  to  incorrect  programs  and 
to  limited  STM.  Both  of  these  sources  are  important 
in  some  task  environments.  Neither  of  them  appears 
to  provide  the  errors  that  occur,  say,  in  a  Sternberg 
paradigm.  The  current  theory  has  implicit  in  it  a 
model  of  error,  but  whether  it  will  work  out  is  not  yat 
clear.  It  is  worth  stating  because  it  transforme  the 
theory  in  an  interesting  way. 

Take  STM  as  having  indefinite  length  hut  being 
aufficicntly  unreliable  so  that  there  is  an  Increasing 
probability  of  an  element  disappearing  entirely. 

Whether  this  is  decay  with  time,  with  activity  or  what 
not  ie  secondary.  The  fate  of  each  element  is  somewhat 
independent  so  that  early  ones  can  disappear  before  lat~ 
er  ones.  This  is  the  primary  error  source,  from  which 
error  propagates  to  all  casks  according  to  the  strategy 
with  which  the  subject  operates.  Such  a  strengthening 
of  the  unreliability  assumption  will  reinforce  the 
encoding  hypothesis,  so  that  all  tasks  must  be  dealt 
with  by  encoding.  The  role  of  STM  becomes  one  of  hold¬ 
ing  a  few  items  after  decoding  (dumping  into  STM)  to  be 
picked  up  quickly  by  coupled  productions,  end  of  holding 
a  few  items  strung  out  prior  to  encoding  lrco  a  new 
chunk.  Thus  the  short  term  capacity  is  not  the  length 
(or  expected  length)  of  STM,  but  is  composed  from  the 
sire  of  codes  and  the  space  for  their  decoding.  For 
example,  a  short  term  capacity  of  seven  might  occur  via 
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a  chunk  of  three  and  four,  with  the  STK  nolding  four 
items  reliably  enough  to  get  them  decoded  and  emitted. 
Thus,  no  memory  structure  exists  in  the  system  that 
has  a  capacity  of  seven.  In  particular  the  STM  would 
appear  to  be  misnamed. 

As  we  have  already  mentioned,  Che  theory  is  miss¬ 
ing  perceptual  mechanisms,  effector  mechanisms  and  a 
good  theory  of  LTM  acquisition.  All  of  these  are 
serious.  The  question  of  how  to  acquire  new  productions 
seems  to  me  the  most  serious  of  all.  In  parr  thia  is 
because  we  know  it  to  be  a  hard  problem,  whereas  the, 
others  appear  to  be  sinroly  aspects  that  have  not  re¬ 
ceived  their  share  of  attention. 

All  existing  theory  is  delightfully  vague  on  the 
mechanism  of  I.TM  acquisition.  It  is  tied  somehow  to 
amount  of  residence  in  STM,  measured  either  by  time  or 
by  rehearsals.;  But  what  is  stored  is  left  unspecified. 
Proposing  to  create  a  new  production  makes  clear  that 
decisions  (by  the  system)  must  be  made  about  both  con¬ 
ditions  and  actions..  The  condition  is  essentially  the 
access  path.  The  action  is  essentially  the  content, 
though  it  consists  of  both  passive  content  (elements 
to  STM)  and  active  content  (operators) .  Since  thsre 
is  good,  though  indirect,  evidence  that  humans  do  not 
have  voluntary  control  of  the  acquisition  process  (i.e., 
operators  for  constructing  productions,  which  can  be 
part  of  actions),  there  must  be  some  more  automatic 
process  for  learning.  Its  structure  is  a  puzzle. 

The  fate  of  the  decoding  hypothesis  is  extremely 
uncertain.  The  appeal  of  an  indirect  non-obvious 
explanation  of  a  major  regularity  in  behavior  must  be 
resisted.  There  are  an  Immense  number  of  studies  whose 
interpretation  seem  straightforward  in  terms  of  linear 
search.  Until  the  decoding  hypothesis  is  shown  to  be 
compatible  with  many  more  of  these  than  the  present 
paper  has  considered,  the  hypothesis  should  be  taken 
as  a  strictly  secondary  challenger.  However,  the 
emphasis  that  it  gives  to  the  processes  of  coding  snd 
decoding  seems  certainly  on  the  right  track. 
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